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Foreword 


Beginning in the spring of 2000, a series of four one-semester courses 
were taught at Princeton University whose purpose was to present, in 
an integrated manner, the core areas of analysis. The objective was to 
make plain the organic unity that exists between the various parts of the 
subject, and to illustrate the wide applicability of ideas of analysis to 
other fields of mathematics and science. The present series of books is 
an elaboration of the lectures that were given. 

While there are a number of excellent texts dealing with individual 
parts of what we cover, our exposition aims at a different goal: pre- 
senting the various sub-areas of analysis not as separate disciplines, but 
rather as highly interconnected. It is our view that seeing these relations 
and their resulting synergies will motivate the reader to attain a better 
understanding of the subject as a whole. With this outcome in mind, we 
have concentrated on the main ideas and theorems that have shaped the 
field (sometimes sacrificing a more systematic approach), and we have 
been sensitive to the historical order in which the logic of the subject 
developed. 

We have organized our exposition into four volumes, each reflecting 
the material covered in a semester. Their contents may be broadly sum- 
marized as follows: 

I. Fourier series and integrals. 

II. Complex analysis. 

III. Measure theory, Lebesgue integration, and Hilbert spaces. 

IV. A selection of further topics, including functional analysis, distri- 
butions, and elements of probability theory. 

However, this listing does not by itself give a complete picture of 
the many interconnections that are presented, nor of the applications 
to other branches that are highlighted. To give a few examples: the ele- 
ments of (finite) Fourier series studied in Book I, which lead to Dirichlet 
characters, and from there to the infinitude of primes in an arithmetic 
progression; the X-ray and Radon transforms, which arise in a number of 
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problems in Book I, and reappear in Book III to play an important role in 
understanding Besicovitch-like sets in two and three dimensions; Fatou’s 
theorem, which guarantees the existence of boundary values of bounded 
holomorphic functions in the disc, and whose proof relies on ideas devel- 
oped in each of the first three books; and the theta function, which first 
occurs in Book I in the solution of the heat equation, and is then used 
in Book II to find the number of ways an integer can be represented as 
the sum of two or four squares, and in the analytic continuation of the 
zeta function. 

A few further words about the books and the courses on which they 
were based. These courses where given at a rather intensive pace, with 48 
lecture-hours a semester. The weekly problem sets played an indispens- 
able part, and as a result exercises and problems have a similarly im- 
portant role in our books. Each chapter has a series of “Exercises” that 
are tied directly to the text, and while some are easy, others may require 
more effort. However, the substantial number of hints that are given 
should enable the reader to attack most exercises. There are also more 
involved and challenging “Problems”; the ones that are most difficult, or 
go beyond the scope of the text, are marked with an asterisk. 

Despite the substantial connections that exist between the different 
volumes, enough overlapping material has been provided so that each of 
the first three books requires only minimal prerequisites: acquaintance 
with elementary topics in analysis such as limits, series, differentiable 
functions, and Riemann integration, together with some exposure to lin- 
ear algebra. This makes these books accessible to students interested 
in such diverse disciplines as mathematics, physics, engineering, and 
finance, at both the undergraduate and graduate level. 

It is with great pleasure that we express our appreciation to all who 
have aided in this enterprise. We are particularly grateful to the stu- 
dents who participated in the four courses. Their continuing interest, 
enthusiasm, and dedication provided the encouragement that made this 
project possible. We also wish to thank Adrian Banner and Jose Luis 
Rodrigo for their special help in running the courses, and their efforts to 
see that the students got the most from each class. In addition, Adrian 
Banner also made valuable suggestions that are incorporated in the text. 
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We wish also to record a note of special thanks for the following in- 
dividuals: Charles Fefferman, who taught the first week (successfully 
launching the whole project!); Paul Hagelstein, who in addition to read- 
ing part of the manuscript taught several weeks of one of the courses, and 
has since taken over the teaching of the second round of the series; and 
Daniel Levine, who gave valuable help in proof-reading. Last but not 
least, our thanks go to Gerree Pecht, for her consummate skill in type- 
setting and for the time and energy she spent in the preparation of all 
aspects of the lectures, such as transparencies, notes, and the manuscript. 

We are also happy to acknowledge our indebtedness for the support 
we received from the 250th Anniversary Fund of Princeton University, 
and the National Science Foundation’s VIGRE program. 
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Preface to Book I 


Any effort to present an overall view of analysis must at its start deal 
with the following questions: Where does one begin? What are the initial 
subjects to be treated, and in what order are the relevant concepts and 
basic techniques to be developed? 

Our answers to these questions are guided by our view of the centrality 
of Fourier analysis, both in the role it has played in the development of 
the subject, and in the fact that its ideas permeate much of the present- 
day analysis. For these reasons we have devoted this first volume to an 
exposition of some basic facts about Fourier series, taken together with 
a study of elements of Fourier transforms and finite Fourier analysis. 
Starting this way allows one to see rather easily certain applications to 
other sciences, together with the link to such topics as partial differential 
equations and number theory. In later volumes several of these connec- 
tions will be taken up from a more systematic point of view, and the ties 
that exist with complex analysis, real analysis, Hilbert space theory, and 
other areas will be explored further. 

In the same spirit, we have been mindful not to overburden the begin- 
ning student with some of the difficulties that are inherent in the subject: 
a proper appreciation of the subtleties and technical complications that 
arise can come only after one has mastered some of the initial ideas in- 
volved. This point of view has led us to the following choice of material 
in the present volume: 

• Fourier series. At this early stage it is not appropriate to intro- 
duce measure theory and Lebesgue integration. For this reason 
our treatment of Fourier series in the first four chapters is carried 
out in the context of Riemann integrable functions. Even with this 
restriction, a substantial part of the theory can be developed, de- 
tailing convergence and summability; also, a variety of connections 
with other problems in mathematics can be illustrated. 

• Fourier transform. For the same reasons, instead of undertaking 
the theory in a general setting, we confine ourselves in Chapters 5 
and 6 largely to the framework of test functions. Despite these lim- 
itations, we can learn a number of basic and interesting facts about 
Fourier analysis in and its relation to other areas, including the 
wave equation and the Radon transform. 
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xii PREFACE TO BOOK I 

• Finite Fourier analysis. This is an introductory subject par excel- 
lence, because limits and integrals are not explicitly present. Nev- 
ertheless, the subject has several striking applications, including 
the proof of the infinitude of primes in arithmetic progression. 

Taking into account the introductory nature of this first volume, we 
have kept the prerequisites to a minimum. Although we suppose some 
acquaintance with the notion of the Riemann integral, we provide an 
appendix that contains most of the results about integration needed in 
the text. 

We hope that this approach will facilitate the goal that we have set 
for ourselves: to inspire the interested reader to learn more about this 
fascinating subject, and to discover how Fourier analysis affects decisively 
other parts of mathematics and science. 
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1 


The Genesis of Fourier 
Analysis 


Regarding the researches of d’Alembert and Euler could 
one not add that if they knew this expansion, they 
made but a very imperfect use of it. They were both 
persuaded that an arbitrary and discontinuous func- 
tion could never be resolved in series of this kind, and 
it does not even seem that anyone had developed a 
constant in cosines of multiple arcs, the first problem 
which I had to solve in the theory of heat. 

J. Fourier, 1808-9 


In the beginning, it was the problem of the vibrating string, and the 
later investigation of heat flow, that led to the development of Fourier 
analysis. The laws governing these distinct physical phenomena were 
expressed by two different partial differential equations, the wave and 
heat equations, and these were solved in terms of Fourier series. 

Here we want to start by describing in some detail the development 
of these ideas. We will do this initially in the context of the problem of 
the vibrating string, and we will proceed in three steps. First, we de- 
scribe several physical (empirical) concepts which motivate correspond- 
ing mathematical ideas of importance for our study. These are: the role 
of the functions cos t, sin t, and e lt suggested by simple harmonic mo- 
tion; the use of separation of variables, derived from the phenomenon 
of standing waves; and the related concept of linearity, connected to the 
superposition of tones. Next, we derive the partial differential equation 
which governs the motion of the vibrating string. Finally, we will use 
what we learned about the physical nature of the problem (expressed 
mathematically) to solve the equation. In the last section, we use the 
same approach to study the problem of heat diffusion. 

Given the introductory nature of this chapter and the subject matter 
covered, our presentation cannot be based on purely mathematical rea- 
soning. Rather, it proceeds by plausibility arguments and aims to provide 
the motivation for the further rigorous analysis in the succeeding chap- 
ters. The impatient reader who wishes to begin immediately with the 
theorems of the subject may prefer to pass directly to the next chapter. 
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2 Chapter 1. THE GENESIS OF FOURIER ANALYSIS 

1 The vibrating string 

The problem consists of the study of the motion of a string fixed at 
its end points and allowed to vibrate freely. We have in mind physical 
systems such as the strings of a musical instrument. As we mentioned 
above, we begin with a brief description of several observable physical 
phenomena on which our study is based. These are: 

• simple harmonic motion, 

• standing and traveling waves, 

• harmonics and superposition of tones. 

Understanding the empirical facts behind these phenomena will moti- 
vate our mathematical approach to vibrating strings. 


Simple harmonic motion 

Simple harmonic motion describes the behavior of the most basic oscil- 
latory system (called the simple harmonic oscillator), and is therefore 
a natural place to start the study of vibrations. Consider a mass {m} 
attached to a horizontal spring, which itself is attached to a fixed wall, 
and assume that the system lies on a frictionless surface. 

Choose an axis whose origin coincides with the center of the mass when 
it is at rest (that is, the spring is neither stretched nor compressed), as 
shown in Figure 1. When the mass is displaced from its initial equilibrium 



Figure 1 . Simple 



position and then released, it will undergo simple harmonic motion. 
This motion can be described mathematically once we have found the 
differential equation that governs the movement of the mass. 

Let y{t) denote the displacement of the mass at time t. We assume that 
the spring is ideal, in the sense that it satisfies Hooke’s law: the restoring 
force F exerted by the spring on the mass is given by F — —ky(t). Here 
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1. The vibrating string 3 

k > 0 is a given physical quantity called the spring constant. Applying 
Newton’s law (force = mass x acceleration), we obtain 

-ky(t) = my"(t), 

where we use the notation y" to denote the second derivative of y with 
respect to t. With c = ^ k/m , this second order ordinary differential 
equation becomes 

( 1 ) y"{t) + c 2 y{t) = 0 . 

The general solution of equation (1) is given by 

y(t) = a cos ct + b sin ct , 


where a and b are constants. Clearly, all functions of this form solve 
equation (1), and Exercise 6 outlines a proof that these are the only 
(twice differentiable) solutions of that differential equation. 

In the above expression for y(t), the quantity c is given, but a and b 
can be any real numbers. In order to determine the particular solution 
of the equation, we must impose two initial conditions in view of the 
two unknown constants a and b. For example, if we are given y( 0) and 
y'{ 0), the initial position and velocity of the mass, then the solution of 
the physical problem is unique and given by 


y(t) = ?/(0) cos ct + 


2 /( 0 ) 


sin ct . 


One can easily verify that there exist constants A > 0 and ip E M such 
that 


a cos ct + b sin ct — A cos (ct — tp) . 

Because of the physical interpretation given above, one calls A = \/a 2 + b 2 
the “amplitude” of the motion, c its “natural frequency,” tp its “phase” 
(uniquely determined up to an integer multiple of 27 t), and 2i t/c the 
“period” of the motion. 

The typical graph of the function Acos(ct-ip), illustrated in 
Figure 2, exhibits a wavelike pattern that is obtained from translating 
and stretching (or shrinking) the usual graph of cos t. 

We make two observations regarding our examination of simple har- 
monic motion. The first is that the mathematical description of the most 
elementary oscillatory system, namely simple harmonic motion, involves 
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Chapter 1. THE GENESIS OF FOURIER ANALYSIS 



Figure 2. The graph of Acos(ct — ip) 


the most basic trigonometric functions cos t and sinf. It will be impor- 
tant in what follows to recall the connection between these functions 
and complex numbers, as given in Euler’s identity e lt — cost + i sinf. 
The second observation is that simple harmonic motion is determined as 
a function of time by two initial conditions, one determining the position, 
and the other the velocity (specified, for example, at time t — 0). This 
property is shared by more general oscillatory systems, as we shall see 
below. 

Standing and traveling waves 

As it turns out, the vibrating string can be viewed in terms of one- 
dimensional wave motions. Here we want to describe two kinds of mo- 
tions that lend themselves to simple graphic representations. 

• First, we consider standing waves. These are wavelike motions 
described by the graphs y = u(x, t) developing in time t as shown 
in Figure 3. 

In other words, there is an initial profile y = ip(x) representing the 
wave at time t = 0, and an amplifying factor depending on t, 
so that y = u(x, t) with 

u(x, t) = ip(x)i(j(t). 

The nature of standing waves suggests the mathematical idea of 
“separation of variables,” to which we will return later. 

• A second type of wave motion that is often observed in nature is 
that of a traveling wave. Its description is particularly simple: 
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5 



Figure 3. A standing wave at different moments in time: t — 0 and 
t = t 0 


there is an initial profile F(x) so that u(x, t) equals F( x) when 
t = 0. As t evolves, this profile is displaced to the right by ct units, 
where c is a positive constant, namely 


u(x, t ) = F{ x — ct). 


Graphically, the situation is depicted in Figure 4. 



Figure 4. A traveling wave at two different moments in time: t — 0 and 
t = t 0 


Since the movement in t is at the rate c, that constant represents the 
velocity of the wave. The function F[x — ct) is a one-dimensional 
traveling wave moving to the right. Similarly, u(x,t) = F(x + ct) 
is a one-dimensional traveling wave moving to the left. 
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Harmonics and superposition of tones 

The final physical observation we want to mention (without going into 
any details now) is one that musicians have been aware of since time 
immemorial. It is the existence of harmonics, or overtones. The pure 
tones are accompanied by combinations of overtones which are primar- 
ily responsible for the timbre (or tone color) of the instrument. The idea 
of combination or superposition of tones is implemented mathematically 
by the basic concept of linearity, as we shall see below. 


We now turn our attention to our main problem, that of describing the 
motion of a vibrating string. First, we derive the wave equation, that is, 
the partial differential equation that governs the motion of the string. 


1.1 Derivation of the wave equation 

Imagine a homogeneous string placed in the (x, y)-plane, and stretched 
along the x-axis between x = 0 and x = L. If it is set to vibrate, its 
displacement y = u(x, t ) is then a function of x and t, and the goal is to 
derive the differential equation which governs this function. 

For this purpose, we consider the string as being subdivided into a 
large number N of masses (which we think of as individual particles) 
distributed uniformly along the x-axis, so that the n th particle has its 
x-coordinate at x n — nL/N . We shall therefore conceive of the vibrat- 
ing string as a complex system of N particles, each oscillating in the 
vertical direction only, however, unlike the simple harmonic oscillator we 
considered previously, each particle will have its oscillation linked to its 
immediate neighbor by the tension of the string. 



x 


n— 1 


X n 

V 

V 

h 


Xn -\- 1 
J 


Figure 5. A vibrating string as a discrete system of masses 
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We then set y n {t) = u(x n ,t), and note that x n+ i — x n = h, with h — 
L/N. If we assume that the string has constant density p > 0, it is 
reasonable to assign mass equal to ph to each particle. By Newton’s law, 
phy”(t) equals the force acting on the n th particle. We now make the 
simple assumption that this force is due to the effect of the two nearby 
particles, the ones with x-coordinates at x n -i and x n +i (see Figure 5). 
We further assume that the force (or tension) coming from the right of 
the n th particle is proportional to (y n + 1 — Vn)/h, where h is the distance 
between x n + i and x n ; hence we can write the tension as 

(^) (Vn+ 1 - Vn), 

where r > 0 is a constant equal to the coefficient of tension of the string. 
There is a similar force coming from the left, and it is 

( j {Vn—l ~ Vn)- 

Altogether, adding these forces gives us the desired relation between the 
oscillators y n (t), namely 

(2) phy'nit) = ^ {Vn+iit) + y n -i{t) - 2 y n (t)} . 

On the one hand, with the notation chosen above, we see that 

Vn+i(t) + y n -i(t) ~ 2y n (t) = u(x n + h,t) + u(x n - h,t) - 2 u(x n ,t). 

On the other hand, for any reasonable function F(x) (that is, one that 
has continuous second derivatives) we have 

F{x + h) + F(x-h)-2F(x) ^ Fll{x) 

Thus we may conclude, after dividing by h in (2) and letting h tend to 
zero (that is, N goes to infinity), that 

d 2 u _ d 2 u 
P dt 2 dx 2 ’ 

or 

1 d 2 u d 2 u 
c 2 dt 2 dx 2 ’ 

This relation is known as the one-dimensional wave equation, or 
more simply as the wave equation. For reasons that will be apparent 
later, the coefficient c > 0 is called the velocity of the motion. 


with c = \Jt / p. 
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In connection with this partial differential equation, we make an im- 
portant simplifying mathematical remark. This has to do with scaling, 
or in the language of physics, a “change of units.” That is, we can think of 
the coordinate x as x = aX where a is an appropriate positive constant. 
Now, in terms of the new coordinate X, the interval 0 < x < L becomes 
0 < X < L/a. Similarly, we can replace the time coordinate t by t = bT , 
where b is another positive constant. If we set U(X, T) = u{x , t), then 

dU du d 2 U 2 <9 2 u 

dX ° dx ' dX 2 a dx 2 ’ 

and similarly for the derivatives in t. So if we choose a and b appropri- 
ately, we can transform the one-dimensional wave equation into 

d 2 U d 2 U 

gf2 - QX2’ 

which has the effect of setting the velocity c equal to 1. Moreover, we have 
the freedom to transform the interval 0<x<Tto0<X < n. (We shall 
see that the choice of 7 r is convenient in many circumstances.) All this 
is accomplished by taking a = L/ n and b — L/(cn). Once we solve the 
new equation, we can of course return to the original equation by making 
the inverse change of variables. Hence, we do not sacrifice generality by 
thinking of the wave equation as given on the interval [0, 7r] with velocity 
c = 1. 


1.2 Solution to the wave equation 

Having derived the equation for the vibrating string, we now explain two 
methods to solve it: 

• using traveling waves, 

• using the superposition of standing waves. 

While the first approach is very simple and elegant, it does not directly 
give full insight into the problem; the second method accomplishes that, 
and moreover is of wide applicability. It was first believed that the second 
method applied only in the simple cases where the initial position and 
velocity of the string were themselves given as a superposition of standing 
waves. However, as a consequence of Fourier’s ideas, it became clear that 
the problem could be worked either way for all initial conditions. 
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Traveling waves 

To simplify matters as before, we assume that c = 1 and L — tt, so that 
the equation we wish to solve becomes 


d 2 u 


d 2 u 
dx 2 


on 0 < x < tt. 


The crucial observation is the following: if F is any twice differentiable 
function, then u(x,t) = F(x + t) and u(x,t) = F(x — t ) solve the wave 
equation. The verification of this is a simple exercise in differentiation. 
Note that the graph of u(x, t) = F(x — t ) at time t — 0 is simply the 
graph of F. and that at time t — 1 it becomes the graph of F translated 
to the right by 1. Therefore, we recognize that F(x — t) is a traveling 
wave which travels to the right with speed 1. Similarly, u(x, t) = F(x + t) 
is a wave traveling to the left with speed 1. These motions are depicted 
in Figure 6. 



Our discussion of tones and their combinations leads us to observe 
that the wave equation is linear. This means that if u(x,t) and v(x,t) 
are particular solutions, then so is au(x,t) + (3v(x, t). where a and (3 
are any constants. Therefore, we may superpose two waves traveling in 
opposite directions to find that whenever F and G are twice differentiable 
functions, then 

u(x, t) = F(x + 1) + G(x — t ) 

is a solution of the wave equation. In fact, we now show that all solutions 
take this form. 

We drop for the moment the assumption that 0 < x < n, and suppose 
that u is a twice differentiable function which solves the wave equation 
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for all real x and t. Consider the following new set of variables £ = x + t, 
?7 = x — t, and define v(£, rj) = u(x, t). The change of variables formula 
shows that v satisfies 

d 2 v 

d£dr/ 

Integrating this relation twice gives v(£,r]) — F(£) + G(rj), which then 
implies 


u(x , t) = F(x + t) + G(x — t), 
for some functions F and G. 

We must now connect this result with our original problem, that is, 
the physical motion of a string. There, we imposed the restrictions 0 < 
x < 7r, the initial shape of the string u(x, 0) = f(x), and also the fact 
that the string has fixed end points, namely u(0,t) — u(n,t) = 0 for all 
t. To use the simple observation above, we first extend / to all of R by 
making it odd 1 on [ — 7r, 7 t], and then periodic 2 in x of period 27T, and 
similarly for u(x,t ), the solution of our problem. Then the extension u 
solves the wave equation on all of R, and u(x, 0) = f(x) for all x € R. 
Therefore, u(x, t ) = F(x + t) + G(x — t), and setting t = 0 we find that 

F(x) + G{x) = f{x). 

Since many choices of F and G will satisfy this identity, this suggests 
imposing another initial condition on u (similar to the two initial condi- 
tions in the case of simple harmonic motion), namely the initial velocity 
of the string which we denote by g{x)\ 

dii 

— (x,0) = g{x), 

where of course g(0) = g(n) = 0. Again, we extend g to R first by mak- 
ing it odd over [ — 7r, 7t] , and then periodic of period 27t. The two initial 
conditions of position and velocity now translate into the following sys- 
tem: 

f F{x) + G{x) = f(x ) , 

1 F’(x)-G’(x) = g(x). 


1 A function / defined on a set U is odd if —x E U whenever x E U and f(—x) = —f(x), 
and even if f(—x) = f(x). 

2 A function / on R is periodic of period uj if f(x + lj) = f(x) for all x. 
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Differentiating the first equation and adding it to the second, we obtain 


2 F'(x) = f(x) + g(x). 


Similarly 


2 G'(x) = f'{x) - g(x), 


and hence there are constants C\ and C 2 so that 


F(x) 


/(*) + 



g(y) dy 


+ c i 


and 


G(x) 




ff(y) dy 


+ c 2 . 


Since .F(a;) + G(x) = f(x) we conclude that C\ + C 2 = 0, and therefore, 
our final solution of the wave equation with the given initial conditions 
takes the form 

u(x , t) = \ [f(x + t) + f(x ~t)} + ]- f g(y) dy. 
z 1 Jx-t 


The form of this solution is known as d’Alembert’s formula. Observe 
that the extensions we chose for / and g guarantee that the string always 
has fixed ends, that is, u{ 0, t) = u(n, t) — 0 for all t. 

A final remark is in order. The passage from t > 0 to t £ M, and then 
back to t > 0, which was made above, exhibits the time reversal property 
of the wave equation. In other words, a solution u to the wave equation 
for t > 0, leads to a solution u~ defined for negative time t < 0 simply 
by setting u~(x,t) = u(x, — t ), a fact which follows from the invariance 
of the wave equation under the transformation 1 1 — > — t. The situation is 
quite different in the case of the heat equation. 


Superposition of standing waves 

We turn to the second method of solving the wave equation, which is 
based on two fundamental conclusions from our previous physical obser- 
vations. By our considerations of standing waves, we are led to look for 
special solutions to the wave equation which are of the form (p(x)ip(t). 
This procedure, which works equally well in other contexts (in the case 
of the heat equation, for instance), is called separation of variables 
and constructs solutions that are called pure tones. Then by the linearity 
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of the wave equation, we can expect to combine these pure tones into a 
more complex combination of sound. Pushing this idea further, we can 
hope ultimately to express the general solution of the wave equation in 
terms of sums of these particular solutions. 

Note that one side of the wave equation involves only differentiation 
in x, while the other, only differentiation in t. This observation pro- 
vides another reason to look for solutions of the equation in the form 
u(x,t) = ip(x)ip(t) (that is, to “separate variables”), the hope being to 
reduce a difficult partial differential equation into a system of simpler 
ordinary differential equations. In the case of the wave equation, with u 
of the above form, we get 

ip(x)ip"{t) = ip"(x)ip(t), 

and therefore 

(p"(x) 
ip(t) <p{x) ' 

The key observation here is that the left-hand side depends only on t, 
and the right-hand side only on x. This can happen only if both sides 
are equal to a constant, say A. Therefore, the wave equation reduces to 
the following 

f - A = 0 
’ \ <p"(x) — \<p(x) — 0. 

We focus our attention on the first equation in the above system. At 
this point, the reader will recognize the equation we obtained in the 
study of simple harmonic motion. Note that we need to consider only 
the case when A < 0, since when A > 0 the solution i/j will not oscillate 
as time varies. Therefore, we may write A — — m 2 , and the solution of 
the equation is then given by 

ip{t) — A cos mt + B sin mt. 

Similarly, we find that the solution of the second equation in (3) is 
ip(x) = A cos mx + B sin mx. 

Now we take into account that the string is attached at x — 0 and x = it. 
This translates into </?(0) = = 0, which in turn gives A — 0, and 

if B 0, then m must be an integer. If m = 0, the solution vanishes 
identically, and if m < — 1, we may rename the constants and reduce to 
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the case m > 1 since the function sin y is odd and cos y is even. Finally, 
we arrive at the guess that for each m > 1, the function 

u m (x,t) — ( A m cos mt + B m sin mt) sinma;, 

which we recognize as a standing wave, is a solution to the wave equa- 
tion. Note that in the above argument we divided by ip and ip, which 
sometimes vanish, so one must actually check by hand that the standing 
wave u m solves the equation. This straightforward calculation is left as 
an exercise to the reader. 

Before proceeding further with the analysis of the wave equation, we 
pause to discuss standing waves in more detail. The terminology comes 
from looking at the graph of u m (x,t) for each fixed t. Suppose first that 
m = 1, and take u(x,t) — cost sin x. Then, Figure 7 (a) gives the graph 
of u for different values of t. 



Figure 7. Fundamental tone (a) and overtones (b) at different moments 
in time 


The case m — 1 corresponds to the fundamental tone or first har- 
monic of the vibrating string. 

We now take m — 2 and look at u(x,t ) = cos2tsin2x. This corre- 
sponds to the first overtone or second harmonic, and this motion is 
described in Figure 7 (b). Note that u(n/2,t) = 0 for all t. Such points, 
which remain motionless in time, are called nodes, while points whose 
motion has maximum amplitude are named anti- nodes. 

For higher values of m we get more overtones or higher harmonics. 
Note that as m increases, the frequency increases, and the period 2n/m 
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decreases. Therefore, the fundamental tone has a lower frequency than 
the overtones. 

We now return to the original problem. Recall that the wave equation 
is linear in the sense that if u and v solve the equation, so does era + (3v 
for any constants a and f3. This allows us to construct more solutions 
by taking linear combinations of the standing waves u m . This technique, 
called superposition, leads to our final guess for a solution of the wave 
equation 


OO 

(4) u(x,t) — ^ (A m cos mt + B m sin mt) sin rax. 

m = 1 


Note that the above sum is infinite, so that questions of convergence 
arise, but since most of our arguments so far are formal, we will not 
worry about this point now. 

Suppose the above expression gave all the solutions to the wave equa- 
tion. If we then require that the initial position of the string at time 
t — 0 is given by the shape of the graph of the function / on [0, 7r], with 
of course /( 0) = f(ir) = 0, we would have u(x,0) = /(x), hence 


OO 

A m smmx = f(x). 

771=1 


Since the initial shape of the string can be any reasonable function /, we 
must ask the following basic question: 

Given a function / on [0,7r] (with /( 0) = f(n) = 0), can we 
find coefficients A m so that 


OO 

(5) f(x) — ^ A m sin mx ? 

m=l 


This question is stated loosely, but a lot of our effort in the next two 
chapters of this book will be to formulate the question precisely and 
attempt to answer it. This was the basic problem that initiated the 
study of Fourier analysis. 

A simple observation allows us to guess a formula giving A m if the 
expansion (5) were to hold. Indeed, we multiply both sides by sinnx 
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and integrate between [0,7r]; working formally, we obtain 

/*7 T /»7T / OO \ 

/ f{x)smnxdx— / y A m sin mx ) sin nx dx 
J 0 \m = 1 / 


= E A m / sin m.T sin nx dx = A n ■ — , 


m = 1 


where we have used the fact that 

, f 0 if m 7^ n, 
sin mx sm nxdx = < r 

^ 7t/ 2 it m = n. 

Therefore, the guess for A n , called the n th Fourier sine coefficient of /, 
is 

2 f 7 " 

(6) = — / f(x) sin nxdx. 

K Jo 

We shall return to this formula, and other similar ones, later. 

One can transform the question about Fourier sine series on [0, 7r] to 
a more general question on the interval [ — 7r, 7t] . If we could express / 
on [0, 7r] in terms of a sine series, then this expansion would also hold on 
[ — 7r, 7r] if we extend / to this interval by making it odd. Similarly, one 
can ask if an even function g(x) on [ — 7r, 7r] can be expressed as a cosine 
series, namely 

OO 

9{x) = ^2 A' m cosrnx. 

m = 0 

More generally, since an arbitrary function F on [ — 7r, tt\ can be expressed 
as / + g, where / is odd and g is even, 3 we may ask if F can be written 
as 

OO OO 

F(x) = E A m sinmx + E A' rn cos mx . 

ra=l m = 0 

or by applying Euler’s identity e lx = cos x + i sin x, we could hope that 
F takes the form 

OO 

F(x)= E a mF mx - 

771 = — OO 



3 Take, for example, f(x) = [F(x) — F(—x )]/ 2 and g(x) = [-^(ir) + F(—x)]/ 2. 
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By analogy with (6), we can use the fact that 


1 

2tt 



^imx ^—inx 


dx 


0 if n m 

1 if tt, — m, 


to see that one expects that 




1 

2? r 



F{x)e~ inx 


dx. 


The quantity a n is called the n th Fourier coefficient of F. 

We can now reformulate the problem raised above: 

Question: Given any reasonable function F on [— 7r, 7 t], with 
Fourier coefficients defined above, is it true that 


OO 

(7) F(x)= Y, a rne imx ? 

m =— oo 

This formulation of the problem, in terms of complex exponentials, is 
the form we shall use the most in what follows. 

Joseph Fourier (1768-1830) was the first to believe that an “arbitrary” 
function F could be given as a series (7). In other words, his idea was 
that any function is the linear combination (possibly infinite) of the most 
basic trigonometric functions sinmx and cos mx, where m ranges over 
the integers. 4 Although this idea was implicit in earlier work, Fourier had 
the conviction that his predecessors lacked, and he used it in his study 
of heat diffusion; this began the subject of “Fourier analysis.” This 
discipline, which was first developed to solve certain physical problems, 
has proved to have many applications in mathematics and other fields as 
well, as we shall see later. 

We return to the wave equation. To formulate the problem correctly, 
we must impose two initial conditions, as our experience with simple 
harmonic motion and traveling waves indicated. The conditions assign 
the initial position and velocity of the string. That is, we require that u 
satisfy the differential equation and the two conditions 

Q u 

u(x, 0) = f(x) and — (x, 0) = g(x), 


4 The first proof that a general class of functions can be represented by Fourier series 
was given later by Dirichlet; see Problem 6, Chapter 4. 
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where / and g are pre- assigned functions. Note that this is consistent 
with (4) in that this requires that f and g be expressible as 


OO OO 

/(*) = H j4 m sinrna; and g(x) = ^ mB ?n sin mx. 

m= 1 m = 1 

1.3 Example: the plucked string 

We now apply our reasoning to the particular problem of the plucked 
string. For simplicity we choose units so that the string is taken on the 
interval [0, 7r], and it satisfies the wave equation with c — 1. The string is 
assumed to be plucked to height h at the point p with 0 < p < 7r; this is 
the initial position. That is, we take as our initial position the triangular 
shape given by 


/(*) 


xh 



for 0 < x < p 

P 


h( 7T — x) 



for p < x < tv. 


TT -p 


which is depicted in Figure 8. 



Figure 8. Initial position of a plucked string 


We also choose an initial velocity g(x) identically equal to 0. Then, we 
can compute the Fourier coefficients of / (Exercise 9), and assuming that 
the answer to the question raised before (5) is positive, we obtain 


OO 

/(*) = ^ A m sin mx with A m 

m = 1 


2 h sin mp 
m 2 p(tt — p) 
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Thus 


OO 

(8) u(x,t) — ^ A m cos mt sin mx, 

m = 1 


and note that this series converges absolutely. The solution can also be 
expressed in terms of traveling waves. In fact 


(9) 


u(x, t) 


f(x + t) + f(x - t) 
2 


Here f(x) is defined for all x as follows: first, / is extended to [— n, 7r] by 
making it odd, and then / is extended to the whole real line by making 
it periodic of period 2tt, that is, f(x + 27r k) = f(x) for all integers k. 

Observe that (8) implies (9) in view of the trigonometric identity 

cos v sin u — — sin(u + v) + sin(u — u)] . 

As a final remark, we should note an unsatisfactory aspect of the so- 
lution to this problem, which however is in the nature of things. Since 
the initial data f(x) for the plucked string is not twice continuously dif- 
ferentiable, neither is the function u (given by (9)). Hence u is not truly 
a solution of the wave equation: while u(x, t) does represent the position 
of the plucked string, it does not satisfy the partial differential equation 
we set out to solve! This state of affairs may be understood properly 
only if we realize that u does solve the equation, but in an appropriate 
generalized sense. A better understanding of this phenomenon requires 
ideas relevant to the study of “weak solutions” and the theory of “dis- 
tributions.” These topics we consider only later, in Books III and IV. 


2 The heat equation 

We now discuss the problem of heat diffusion by following the same 
framework as for the wave equation. First, we derive the time-dependent 
heat equation, and then study the steady-state heat equation in the disc, 
which leads us back to the basic question (7). 

2.1 Derivation of the heat equation 

Consider an infinite metal plate which we model as the plane IR 2 , and 
suppose we are given an initial heat distribution at time t = 0. Let the 
temperature at the point (x,y) at time t be denoted by u(x,y,t). 
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Consider a small square centered at (xo,yo) with sides parallel to the 
axis and of side length h, as shown in Figure 9. The amount of heat 
energy in S at time t is given by 


H(t) — o t Jj u(x,y,t) dxdy . 


where a > 0 is a constant called the specific heat of the material. There- 
fore, the heat flow into S is 


did 

Ik 


— a 



du 

dt 


dx dy , 


which is approximately equal to 

, 2 dll 

ah ~Q^{xo,yo,t), 

since the area of S is h 2 . Now we apply Newton’s law of cooling, which 
states that heat flows from the higher to lower temperature at a rate 
proportional to the difference, that is, the gradient. 


h 


f 


< 

+ 


(*0; 2/0 ) 

V 





(zo + h/2,y 0 ) 


Figure 9. Heat flow through a small square 


The heat flow through the vertical side on the right is therefore 

du 

-Kh -(xo + h/2,y 0 ,t ) , 

where k > 0 is the conductivity of the material. A similar argument for 
the other sides shows that the total heat flow through the square S is 
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given by 


K,h 


du Ou 

— (x 0 + h/2,y 0 ,t) - t^(x 0 - h/2,y 0 ,t) 

du du 

+ -^{x o, yo + h/2, t) - —(X 0 ,y 0 - h/2, t) 


Applying the mean value theorem and letting h tend to zero, we find 
that 

c 7 du d 2 u d 2 u 
n dt dx 2 dy 2 ’ 

this is called the time-dependent heat equation, often abbreviated 
to the heat equation. 


2.2 Steady-state heat equation in the disc 

After a long period of time, there is no more heat exchange, so that 
the system reaches thermal equilibrium and du/dt = 0. In this case, 
the time-dependent heat equation reduces to the steady-state heat 
equation 


(10) 


d 2 u d 2 u 
dx 2 + dy 2 


The operator d 2 /dx 2 + d 2 /dy 2 is of such importance in mathematics and 
physics that it is often abbreviated as A and given a name: the Laplace 
operator or Laplacian. So the steady-state heat equation is written as 


Au = 0, 


and solutions to this equation are called harmonic functions. 
Consider the unit disc in the plane 

D = {(x, y) G R 2 : x 2 + y 2 < 1}, 

whose boundary is the unit circle C. In polar coordinates (r,0), with 
0 < r and 0 < 0 < 2i r, we have 

D = {(r, 9) : 0 < r < 1} and C = {(r, 9) : r — 1}. 

The problem, often called the Dirichlet problem (for the Laplacian 
on the unit disc), is to solve the steady-state heat equation in the unit 
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disc subject to the boundary condition u = f on C. This corresponds to 
hxing a predetermined temperature distribution on the circle, waiting a 
long time, and then looking at the temperature distribution inside the 
disc. 



Figure 10. The Dirichlet problem for the disc 


While the method of separation of variables will turn out to be useful 
for equation (10), a difficulty comes from the fact that the boundary 
condition is not easily expressed in terms of rectangular coordinates. 
Since this boundary condition is best described by the coordinates (r, 6 ) , 
namely it(l, 6) — f(9), we rewrite the Laplacian in polar coordinates. An 
application of the chain rule gives (Exercise 10): 

d 2 u 1 du 1 d 2 u 
U dr 2 r dr r 2 d6 2 

We now multiply both sides by r 2 , and since A u — 0, we get 

2 d 2 u du d 2 u 

dr 2 + dr d0 2 

Separating these variables, and looking for a solution of the form 

u(r,9) = F(r)G(9), we find 

r 2 F"(r) +rF'(r) _ G"{6) 

W) ~~~gW 
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Since the two sides depend on different variables, they must both be 
constant, say equal to A. We therefore get the following equations: 

f G"(0) + A G(9) = 0 , 

\ r 2 F"{r) + rF'(r ) — A F(r) = 0. 

Since G must be periodic of period 27r, this implies that A > 0 and (as 
we have seen before) that A = m 2 where m is an integer; hence 

G(9) = A cos m,9 + B sin m9. 

An application of Euler’s identity, e lx — cos x + * sin x, allows one to 
rewrite G in terms of complex exponentials, 

G(9) = Ae ime + Be~ imd . 

With A = m 2 and m/0, two simple solutions of the equation in F are 
F(r) — r m and F(r) = r~ m (Exercise 11 gives further information about 
these solutions). If m = 0, then F(r) = 1 and F(r) — logr are two solu- 
tions. If m > 0, we note that r~ m grows unboundedly large as r tends 
to zero, so F{r)G{9 ) is unbounded at the origin; the same occurs when 
m — 0 and F(r) = logr. We reject these solutions as contrary to our 
intuition. Therefore, we are left with the following special functions: 

u m (r,9) = r Me im0 , m € Z. 

We now make the important observation that (10) is linear, and so as 
in the case of the vibrating string, we may superpose the above special 
solutions to obtain the presumed general solution: 

OO 

u(r,6)= Y, 

m =— oo 

If this expression gave all the solutions to the steady-state heat equation, 
then for a reasonable / we should have 

OO 

«(M)= E ame ime = f(9). 

m =— oo 

We therefore ask again in this context: given any reasonable function / 
on [ 0 , 27 t] with /( 0) = f(2n), can we find coefficients a m so that 

OO 

m = E ? 


m =— oo 
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Historical Note: D’Alembert (in 1747) first solved the equation of the 
vibrating string using the method of traveling waves. This solution was 
elaborated by Euler a year later. In 1753, D. Bernoulli proposed the 
solution which for all intents and purposes is the Fourier series given 
by (4), but Euler was not entirely convinced of its full generality, since 
this could hold only if an “arbitrary” function could be expanded in 
Fourier series. D’Alembert and other mathematicians also had doubts. 
This viewpoint was changed by Fourier (in 1807) in his study of the 
heat equation, where his conviction and work eventually led others to a 
complete proof that a general function could be represented as a Fourier 
series. 

3 Exercises 

1. If 2 = x + iy is a complex number with we define 

N = {x 2 + y 2 ) l/2 

and call this quantity the modulus or absolute value of 2 . 

(a) What is the geometric interpretation of |;r|? 

(b) Show that if \z\ = 0, then 2 = 0. 

(c) Show that if AgR, then |Ar| = |A|| 2 |, where |A| denotes the standard 
absolute value of a real number. 

(d) If Z\ and Zi are two complex numbers, prove that 

\z\Zi\ = |^i||^ 2 | and \zi + z 2 \ < |zi| + \z 2 \. 


(e) Show that if z 0, then |1/2| = 1/|2 


2. If 2 = x + iy is a complex number with i,t/Gl, we define the complex 
conjugate of z by 


z = x — iy. 

(a) What is the geometric interpretation of 2 ? 

(b) Show that \z\ 2 = zz. 

(c) Prove that if 2 belongs to the unit circle, then 1/2 = 2 . 
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3. A sequence of complex numbers {'W n }^=-i is said to converge if there exists 
w £ C such that 


lirn \w n — w| = 0, 


and we say that w is a limit of the sequence. 

(a) Show that a converging sequence of complex numbers has a unique limit. 

The sequence is said to be a Cauchy sequence if for every e > 0 there 

exists a positive integer N such that 


w n — Wm | < e whenever n, to > N. 


(b) Prove that a sequence of complex numbers converges if and only if it is a 
Cauchy sequence. [Hint: A similar theorem exists for the convergence of a 
sequence of real numbers. Why does it carry over to sequences of complex 
numbers?] 

A series X^?T=i Zn °f complex numbers is said to converge if the sequence formed 
by the partial sums 


N 



converges. Let (a„ }“ =1 be a sequence of non-negative real numbers such that 
the series a„ converges. 

(c) Show that if { z n J-jJLj is a sequence of complex numbers satisfying 
\z n \ < o, n for all n, then the series ^ z n converges. [Hint: Use the Cauchy 
criterion.] 

4. For z G C, we define the complex exponential by 



71 = 0 


(a) Prove that the above definition makes sense, by showing that the series 
converges for every complex number z. Moreover, show that the conver- 
gence is uniform 5 on every bounded subset of C. 

(b) If zi, Z 2 are two complex numbers, prove that e zi e z 2 = e Zl+Z2 . [Hint: Use 
the binomial theorem to expand (z i + z 2 )™, as well as the formula for the 
binomial coefficients.] 


5 A sequence of functions {fn(z)}^ =1 is said to be uniformly convergent on a set S if 
there exists a function / on S so that for every e > 0 there is an integer N such that 
| fn(z) — f(z ) | < e whenever n > N and z S S. 
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(c) Show that if z is purely imaginary, that is, z = iy with y G ]R, then 

e ly = cos y + isiny. 

This is Euler’s identity. [Hint: Use power series.] 

(d) More generally, 

e x+ly _ £ x ^ cog y _|_ j g j n y'j 
whenever and show that 

\e x+iy \ = e x . 


(e) Prove that e 2 = 1 if and only if z = 2i rki for some integer k. 

(f) Show that every complex number z = x + iy can be written in the form 


z = re 


ie 


where r is unique and in the range 0 < r < oc, and 9 G R is unique up to 
an integer multiple of 27 t. Check that 

r = |^r| and 9 = arctan(y/o:) 

whenever these formulas make sense. 

(g) In particular, i = e 17r ' 2 . What is the geometric meaning of multiplying a 
complex number by i? Or by e ™ for any 9 gK? 

(h) Given 0 gR, show that 

„ e ie + e~ iB „ e iB - e~ iB 

cos 9 = and sin 9 = . 

2 2 i 

These are also called Euler’s identities. 

(i) Use the complex exponential to derive trigonometric identities such as 

cos(0 + 'd)= cos 9 cos 1 } — siu 9 sin i9, 
and then show that 

2 sin (9 sin = cos(9 — ip) — cos(9 + ip ) , 

2 sin 9 cos 95 = sin($ + tp) + sin(0 — ip). 

This calculation connects the solution given by d’Alembert in terms of 
traveling waves and the solution in terms of superposition of standing 


waves. 
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5. Verify that f(x) = e lnx is periodic with period 2-7T and that 


— [ e inx dx = 

2rr 


1 if n = 0, 
0 if n ^ 0. 


Use this fact to prove that if n, m > 1 we have 
cos nx cos mx dx = 


1 

7T 


0 if n ^ m, 

1 n = m. 


and similarly 


1 

TV 


sin nx sin mx dx = 


0 if n m, 

1 n = m. 


Finally, show that 


/ sin nx cos mx dx = 0 for any n,m. 

J — 7T 

[Hint: Calculate e inx e- imx + e inx e imx and e inx e~ 


6. Prove that if / is a twice continuously differentiable function on R which is 
a solution of the equation 

f"(t)+c 2 f(t) = 0, 

then there exist constants a and b such that 

f(t) = a cos ct + fosinct. 

This can be done by differentiating the two functions g(t) = f(t ) cos ct — c _1 /'(t) sin ct 
and h(t) = f(t) sin ct + c -1 /'(f) cos ct. 


7 . Show that if a and b are real, then one can write 

a cos ct + b sin ct = A cos (ct — ip ) , 
where A = v a 2 + b 2 , and <p is chosen so that 


cos ip = 


V a? + 6 2 


and sin = 


\J a 2 + b 2 


8. Suppose F is a function on (a, b) with two continuous derivatives. Show that 
whenever x and x + h belong to (a, b), one may write 

h 2 

F( x + h) = F(x) + hF'(x) + —F"(x) + h 2 <p(h ) , 
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where tp(h) — * 0 as h — * 0. 

Deduce that 

F(x + h) + F( x — h) — 2F(x) 


[Hint: This is simply a Taylor expansion. It may be obtained by noting that 


fX-\-h 


F( x + h)~ F{ x) = 


F'iy) dy, 


and then writing F'(y) = F'(x) + (y — x)F"(x) + {y — x)ip(y — x), where tp(h) — > 
0 as h — > 0.J 


9. In the case of the plucked string, use the formula for the Fourier sine coeffi- 
cients to show that 

2h sin mp 
m m 2 p(n — p) 

For what position of p are the second, fourth, . . . harmonics missing? For what 
position of p are the third, sixth, . . . harmonics missing? 


10. Show that the expression of the Laplacian 


A = 


d 2 d 2 
dx 2 dy 2 


is given in polar coordinates by the formula 

d 2 Id Id 2 
dr 2 r dr r 2 d6 2 

Also, prove that 


du 

2 

du 

2 

du 

2 1 

du 

dx 

+ 

dy 


dr 

r 2 

dd 


11. Show that if n E Z the only solutions of the differential equation 
r 2 F" (r) + rF'(r) - n 2 F(r) = 0, 

which are twice differentiable when r > 0, are given by linear combinations of 
r n and r~ n when n / 0, and 1 and logr when n = 0. 

[Hint: If F solves the equation, write F(r) = g(r)r n , find the equation satisfied 
by g , and conclude that rg'(r ) + 2 ng(r) = c where c is a constant.] 
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l\ 


1 


U = fi 


U = 0 


A u = 0 


u = 0 


0 


u = fo 


7 r 


Figure 11. Dirichlet problem in a rectangle 


4 Problem 

1. Consider the Dirichlet problem illustrated in Figure 11. 

More precisely, we look for a solution of the steady-state heat equation 
A u = 0 in the rectangle R = {(a;, y) : 0 < x < 7r, 0 < y < 1} that vanishes on 
the vertical sides of R , and so that 


u(x,0) = f o (x) and u(x, 1) = /i (x ) , 


where fo and /i are initial data which fix the temperature distribution on the 
horizontal sides of the rectangle. 

Use separation of variables to show that if f 0 and f± have Fourier expansions 



and 



k = i 


k = 1 


then 



We recall the definitions of the hyperbolic sine and cosine functions: 


sinh x = 


and cosh x = - — — 


2 


Compare this result with the solution of the Dirichlet problem in the strip ob- 
tained in Problem 3, Chapter 5. 
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Basic Properties of Fourier 
Series 


Nearly fifty years had passed without any progress on 
the question of analytic representation of an arbitrary 
function, when an assertion of Fourier threw new light 
on the subject. Thus a new era began for the de- 
velopment of this part of Mathematics and this was 
heralded in a stunning way by major developments in 
mathematical Physics. 

B. Riemann, 1854 


In this chapter, we begin our rigorous study of Fourier series. We set 
the stage by introducing the main objects in the subject, and then for- 
mulate some basic problems which we have already touched upon earlier. 

Our first result disposes of the question of uniqueness: Are two func- 
tions with the same Fourier coefficients necessarily equal? Indeed, a 
simple argument shows that if both functions are continuous, then in 
fact they must agree. 

Next, we take a closer look at the partial sums of a Fourier series. Using 
the formula for the Fourier coefficients (which involves an integration), 
we make the key observation that these sums can be written conveniently 
as integrals: 

^ J D n (x - y)f(y ) dy, 

where {D^} is a family of functions called the Dirichlet kernels. The 
above expression is the convolution of / with the function D? y. Convo- 
lutions will play a critical role in our analysis. In general, given a family 
of functions {K n }, we are led to investigate the limiting properties as n 
tends to infinity of the convolutions 

^ J K n (x - y)f(y) dy. 

We find that if the family {K n } satisfies the three important properties 
of “good kernels,” then the convolutions above tend to f(x) as n — > oo 
(at least when / is continuous). In this sense, the family { K n } is an 
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“approximation to the identity.” Unfortunately, the Dirichlet kernels 
Dn do not belong to the category of good kernels, which indicates that 
the question of convergence of Fourier series is subtle. 

Instead of pursuing at this stage the problem of convergence, we con- 
sider various other methods of summing the Fourier series of a function. 
The first method, which involves averages of partial sums, leads to con- 
volutions with good kernels, and yields an important theorem of Fejer. 
From this, we deduce the fact that a continuous function on the circle 
can be approximated uniformly by trigonometric polynomials. Second, 
we may also sum the Fourier series in the sense of Abel and again en- 
counter a family of good kernels. In this case, the results about convo- 
lutions and good kernels lead to a solution of the Dirichlet problem for 
the steady-state heat equation in the disc, considered at the end of the 
previous chapter. 


1 Examples and formulation of the problem 

We commence with a brief description of the types of functions with 
which we shall be concerned. Since the Fourier coefficients of / are 
defined by 


L 

f{x)e~ 2ninx/L dx, for n G Z, 

where / is complex- valued on [0, L ] , it will be necessary to place some in- 
tegr ability conditions on /. We shall therefore assume for the remainder 
of this book that all functions are at least Riemann integrable. 1 Some- 
times it will be illuminating to focus our attention on functions that 
are more “regular,” that is, functions that possess certain continuity or 
differentiability properties. Below, we list several classes of functions in 
increasing order of generality. We emphasize that we will not generally 
restrict our attention to real- valued functions, contrary to what the fol- 
lowing pictures may suggest; we will almost always allow functions that 
take values in the complex numbers C. Furthermore, we sometimes think 
of our functions as being defined on the circle rather than an interval. 
We elaborate upon this below. 



1 Limiting ourselves to Riemann integrable functions is natural at this elementary stage 
of study of the subject. The more advanced notion of Lebesgue integrability will be taken 
up in Book III. 
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Everywhere continuous functions 

These are the complex-valued functions / which are continuous at every 
point of the segment [0,L], A typical continuous function is sketched in 
Figure 1 (a). We shall note later that continuous functions on the circle 
satisfy the additional condition /( 0) = f(L). 


Piecewise continuous functions 

These are bounded functions on [0, L] which have only finitely many 
discontinuities. An example of such a function with simple discontinuities 
is pictured in Figure 1 (b). 



Figure 1. Functions on [0, L\: continuous and piecewise continuous 


This class of functions is wide enough to illustrate many of the the- 
orems in the next few chapters. However, for logical completeness we 
consider also the more general class of Riemann integrable functions. 
This more extended setting is natural since the formula for the Fourier 
coefficients involves integration. 

Riemann integrable functions 

This is the most general class of functions we will be concerned with. 
Such functions are bounded, but may have infinitely many discontinu- 
ities. We recall the definition of integrability. A real-valued function / 
defined on [0, L\ is Riemann integrable (which we abbreviate as in- 
tegrable 2 ) if it is bounded , and if for every e > 0, there is a subdivision 
0 = xq < x\ < ■ ■ ■ < xjv-i < aw = L of the interval [0, L\, so that if U 


2 Starting in Book III, the term “integrable” will be used in the broader sense of 
Lebesgue theory. 


Ibookroot October 20, 2007 


32 Chapter 2. BASIC PROPERTIES OF FOURIER SERIES 

and C are, respectively, the upper and lower sums of / for this subdivi- 
sion, namely 

N 

U = [ SU P f( x )\( x j - X 3- 1) 

j = \ Xj — l<X<Xj 


and 


N 

C = inf f(x)](xj - Xj- 1) , 

j=i J J 

then we have U — C < e. Finally, we say that a complex- valued function 
is integrable if its real and imaginary parts are integrable. It is worthwhile 
to remember at this point that the sum and product of two integrable 
functions are integrable. 

A simple example of an integrable function on [0, 1] with infinitely 
many discontinuities is given by 

{ 1 if 1 /( 71 + 1) < x < l/n and n is odd, 

0 if 1 /( 71 + 1) < x < l/n and n is even, 

0 if x — 0. 

This example is illustrated in Figure 2. Note that / is discontinuous 
when x = l/n and at x = 0. 


0 


111 

543 


1 

2 


+-3*- 

1 


Figure 2. A Riemann integrable function 


More elaborate examples of integrable functions whose discontinuities 
are dense in the interval [0, 1] are described in Problem 1. In general, 
while integrable functions may have infinitely many discontinuities, these 
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functions are actually characterized by the fact that, in a precise sense, 
their discontinuities are not too numerous: they are “negligible,” that is, 
the set of points where an integrable function is discontinuous has “mea- 
sure 0.” The reader will find further details about Riemann integration 
in the appendix. 

From now on, we shall always assume that our functions are integrable, 
even if we do not state this requirement explicitly. 

Functions on the circle 

There is a natural connection between 27r-periodic functions on M like the 
exponentials e m0 , functions on an interval of length 27r, and functions on 
the unit circle. This connection arises as follows. 

A point on the unit circle takes the form e lS , where 9 is a real number 
that is unique up to integer multiples of 27r. If F is a function on the 
circle, then we may define for each real number 9 

m = F(e i6 ), 

and observe that with this definition, the function / is periodic on IR of 
period 27r, that is, f(0 + 2n) = f(8) for all 9. The integrability, continu- 
ity and other smoothness properties of F are determined by those of /. 
For instance, we say that F is integrable on the circle if / is integrable 
on every interval of length 27r. Also, F is continuous on the circle if / 
is continuous on IR, which is the same as saying that / is continuous on 
any interval of length 27r. Moreover, F is continuously differentiable if / 
has a continuous derivative, and so forth. 

Since / has period 27r, we may restrict it to any interval of length 27r, 
say [0, 27 t] or [— tt, 7t], and still capture the initial function F on the circle. 
We note that / must take the same value at the end-points of the interval 
since they correspond to the same point on the circle. Conversely, any 
function on [0, 27t] for which /( 0) = f(2n) can be extended to a periodic 
function on IR which can then be identified as a function on the circle. 
In particular, a continuous function / on the interval [0, 2 tt] gives rise to 
a continuous function on the circle if and only if /( 0) = /( 2n). 

In conclusion, functions on IR that 27r-periodic, and functions on an 
interval of length 27t that take on the same value at its end-points, are 
two equivalent descriptions of the same mathematical objects, namely, 
functions on the circle. 

In this connection, we mention an item of notational usage. When 
our functions are defined on an interval on the line, we often use x as 
the independent variable; however, when we consider these as functions 
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on the circle, we usually replace the variable x by 0. As the reader will 
note, we are not strictly bound by this rule since this practice is mostly 
a matter of convenience. 

1.1 Main definitions and some examples 

We now begin our study of Fourier analysis with the precise definition of 
the Fourier series of a function. Here, it is important to pin down where 
our function is originally defined. If / is an integrable function given on 
an interval [a, b\ of length L (that is, b — a = L), then the n th Fourier 
coefficient of f is defined by 

/(n) = y [ b f(x)e~ 2ninx / L dx, n G Z. 

L J a 

The Fourier series of / is given formally 3 by 

OO 

E f(n)e 2winx/L . 

n=— oo 

We shall sometimes write a n for the Fourier coefficients of /, and use the 
notation 

OO 

/ (®)~ E a n e 2ninx / L 

n=— oo 

to indicate that the series on the right-hand side is the Fourier series of 
/• 

For instance, if / is an integrable function on the interval [— 7r, 7 r], then 
the n th Fourier coefficient of / is 

/(n) = a n = J” f(e)e~ ine <W, n e Z, 

and the Fourier series of / is 

OO 

/(<?)- E a ^ ine - 

n=— oo 

Here we use 9 as a variable since we think of it as an angle ranging from 
— 7T to 7T. 


3 At this point, we do not say anything about the convergence of the series. 
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Also, if / is defined on [0, 27 t], then the formulas are the same as 
above, except that we integrate from 0 to 27 t in the definition of the 
Fourier coefficients. 

We may also consider the Fourier coefficients and Fourier series for a 
function defined on the circle. By our previous discussion, we may think 
of a function on the circle as a function / on E which is 27r-periodic. 
We may restrict the function f to any interval of length 27 t, for instance 
[0,27 t] or [— 7r, 7r], and compute its Fourier coefficients. Fortunately, / is 
periodic and Exercise 1 shows that the resulting integrals are independent 
of the chosen interval. Thus the Fourier coefficients of a function on the 
circle are well defined. 

Finally, we shall sometimes consider a function g given on [0, 1]. Then 

pl oo 

g{n) = a n = / g(x)e~~ 2nmx dx and g(x) ~ a n e 27rma: . 

n=—oc 

Here we use x for a variable ranging from 0 to 1. 

Of course, if / is initially given on [0, 2n\, then g(x) = f(2irx) is defined 
on [0, 1] and a change of variables shows that the n th Fourier coefficient 
of / equals the n th Fourier coefficient of g. 

Fourier series are part of a larger family called the trigonometric se- 
ries which, by definition, are expressions of the form c n e 2nmx / L 

where c n £ C. If a trigonometric series involves only finitely many non- 
zero terms, that is, c n = 0 for all large |n|, it is called a trigonometric 
polynomial; its degree is the largest value of |n| for which c n ^ 0. 

The N th partial sum of the Fourier series of /, for N a positive 
integer, is a particular example of a trigonometric polynomial. It is 
given by 

N 

SnU)(x)= J2 f(n)e 2 ™ xlL . 

n=—N 

Note that by definition, the above sum is symmetric since n ranges from 
—N to N, a choice that is natural because of the resulting decomposition 
of the Fourier series as sine and cosine series. As a consequence, the 
convergence of Fourier series will be understood (in this book) as the 
“limit” as N tends to infinity of these symmetric sums. 

In fact, using the partial sums of the Fourier series, we can reformulate 
the basic question raised in Chapter 1 as follows: 

Problem: In what sense does S^if) converge to / as N — > oo ? 
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Before proceeding further with this question, we turn to some simple 
examples of Fourier series. 

Example 1. Let f(d) = d for — n < 6 < n. The calculation of the Fourier 
coefficients requires a simple integration by parts. First, if n ^ 0, then 


/(«) = 


1 r 

27T 


0e~ ine dd 


6 


e 

vn 


1 

~ 2tt . 

_ (-l ) n+1 
in 

and if n = 0 we clearly have 


—inO 


+ 


i 


—inO 


2nin J- T 


dd 


/(0) = 2 


Hence, the Fourier series of / is given by 


m - £ 

n^O 


{~ l ) n+1 Jne = 2 (_ 1 )"+ 1 sinnd 


m 


n = 1 


n 


The first sum is over all non-zero integers, and the second is obtained by 
an application of Euler’s identities. It is possible to prove by elementary 
means that the above series converges for every 6, but it is not obvious 
that it converges to f{9). This will be proved later (Exercises 8 and 9 
deal with a similar situation). 


Example 2. Define f(0) — (tt — 6) 2 / 4 for 0 < 6 < 2n. Then successive 
integration by parts similar to that performed in the previous example 
yield 


m 


7T 2 ^ cos rid 

~ b / — - 

12 ^ 
n = 1 




Example 3. The Fourier series of the function 


on [0, 27 t] is 


fid) - — 7r — gh 71 — 61 )® 
sin -kol 


m 



ginO 

n + a’ 
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whenever a is not an integer. 


Example 4. The trigonometric polynomial defined for x € [— n, 7r] by 

N 

D n (x)= ]T e inx 

n=—N 


is called the ./V th Dirichlet kernel and is of fundamental importance in 
the theory (as we shall see later). Notice that its Fourier coefficients a n 
have the property that a n — 1 if \n\ < N and a n — 0 otherwise. A closed 
form formula for the Dirichlet kernel is 


D n (x) 


sm((N + \)x) 
sin(x/2) 


This can be seen by summing the geometric progressions 


N 


5 > 

n = 0 


and 


E - 

n=—N 


with lo — e lx . These sums are, respectively, equal to 


1 - uj n+1 


1 — iO 


and 


uj~ n -l 

1 — OJ 


Their sum is then 

lo~ n — uj n+1 — W N+1 / 2 sin((A' + \)x) 

1 — oj tu -1 / 2 — iv 1 / 2 sin(x/2) 


giving the desired result. 


Example 5. The function P r (9), called the Poisson kernel, is defined 
for 0 € [— 7r, 7r] and 0 < r < 1 by the absolutely and uniformly convergent 
series 


Pr(6) 


oo 

r \ n \ e ind . 

n =— oo 


This function arose implicitly in the solution of the steady-state heat 
equation on the unit disc discussed in Chapter 1. Note that in calcu- 
lating the Fourier coefficients of P r (9) we can interchange the order of 
integration and summation since the sum converges uniformly in 8 for 
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each fixed r, and obtain that the ?z th Fourier coefficient equals rl n L One 
can also sum the series for P r (6) and see that 


Pr{9) 


1 - r 2 

1 — 2r cos 9 + r 2 


In fact, 

OO OO 

P r (6) = ^2 L 0 n +^LJ n with ui = re ie , 

71 = 0 71=1 

where both series converge absolutely. The first sum (an infinite geomet- 
ric progression) equals 1/(1 — to), and likewise, the second is u;/(l — oJ). 
Together, they combine to give 

1 — w + (1 — w)uJ 1 — |u;| 2 1 — r 2 

(1 — w)(l — u?) |1 — oj\ 2 1 — 2rcos9 + r 2 ’ 

as claimed. The Poisson kernel will reappear later in the context of Abel 
summability of the Fourier series of a function. 

Let us return to the problem formulated earlier. The definition of 
the Fourier series of / is purely formal, and it is not obvious whether it 
converges to /. In fact, the solution of this problem can be very hard, 
or relatively easy, depending on the sense in which we expect the series 
to converge, or on what additional restrictions we place on /. 

Let us be more precise. Suppose, for the sake of this discussion, that 
the function / (which is always assumed to be Riemann integrable) is 
defined on [— 7r, n\ . The first question one might ask is whether the partial 
sums of the Fourier series of / converge to / pointwise. That is, do we 
have 

(1) lim Sn ( f)(9) — f(0) for every 9 ? 

iV— >oo 

We see quite easily that in general we cannot expect this result to be 
true at every 9 , since we can always change an integrable function at one 
point without changing its Fourier coefficients. As a result, we might 
ask the same question assuming that / is continuous and periodic. For 
a long time it was believed that under these additional assumptions the 
answer would be “yes.” It was a surprise when Du Bois-Reymond showed 
that there exists a continuous function whose Fourier series diverges at 
a point. We will give such an example in the next chapter. Despite this 
negative result, we might ask what happens if we add more smoothness 
conditions on /: for example, we might assume that f is continuously 
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differentiable, or twice continuously differentiable. We will see that then 
the Fourier series of / converges to / uniformly. 

We will also interpret the limit (1) by showing that the Fourier series 
sums, in the sense of Cesaro or Abel, to the function / at all of its points 
of continuity. This approach involves appropriate averages of the partial 
sums of the Fourier series of /. 

Finally, we can also define the limit (1) in the mean square sense. In 
the next chapter, we will show that if / is merely integrable, then 

\S N (f)(9) - m\ 2 do ^ 0 asTV^oo. 

It is of interest to know that the problem of pointwise convergence of 
Fourier series was settled in 1966 by L. Carleson, who showed, among 
other things, that if / is integrable in our sense, 4 then the Fourier series 
of / converges to / except possibly on a set of “measure 0.” The proof 
of this theorem is difficult and beyond the scope of this book. 

2 Uniqueness of Fourier series 

If we were to assume that the Fourier series of functions / converge to / 
in an appropriate sense, then we could infer that a function is uniquely 
determined by its Fourier coefficients. This would lead to the following 
statement: if / and g have the same Fourier coefficients, then / and g 
are necessarily equal. By taking the difference f — g, this proposition 
can be reformulated as: if f(n) = 0 for all n € Z, then / = 0. As stated, 
this assertion cannot be correct without reservation, since calculating 
Fourier coefficients requires integration, and we see that, for example, 
any two functions which differ at finitely many points have the same 
Fourier series. However, we do have the following positive result. 

Theorem 2.1 Suppose that f is an integrable function on the circle with 
f{n ) = 0 for all n £ Z. Then f(9o) = 0 whenever f is continuous at the 
point 9 q. 

Thus, in terms of what we know about the set of discontinuities of in- 
tegrable functions, 5 we can conclude that / vanishes for “most” values 
of 9. 

Proof. We suppose first that / is real-valued, and argue by con- 
tradiction. Assume, without loss of generality, that / is defined on 


4 Carleson’s proof actually holds for the wider class of functions which are square inte- 
grable in the Lebesgue sense. 

5 See the appendix. 
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[— 7r, 7 r], that d 0 = 0, and /( 0) > 0. The idea now is to construct a fam- 
ily of trigonometric polynomials {pk} that “peak” at 0, and so that 
/ Pk{d)f{6) dO — > oo as k — > oo. This will be our desired contradiction 
since these integrals are equal to zero by assumption. 

Since / is continuous at 0, we can choose 0 < 5 < 7t/2, so that f(0) > 
/( 0)/2 whenever |0| < 5. Let 


p{9) = e + cos 6, 


where e > 0 is chosen so small that \p{9)\ < 1 — e/2, whenever 5 < \6\ < 
ir. Then, choose a positive rj with r) < 6, so that p(6) > 1 + e/2, for 
\9\ <rj. Finally, let 


p k (0) = W)] k 


and select B so that \f(9)\ < B for all 6. This is possible since / is 
integrable, hence bounded. Figure 3 illustrates the family {pk}- By 



Figure 3. The functions p, p 6 , and p 15 when e = 0.1 


construction, each pk is a trigonometric polynomial, and since f(n) — 0 
for all n, we must have 



Flowever, we have the estimate 
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Also, our choice of S guarantees that p{9) and f(9) are non-negative 
whenever \9\ < S, thus 


' r l<\S\<S 


f(9) Pk (9) dd> 0. 


Finally, 

/ f(9) Pk (9)d9> 2r ? ffl(l+e/2) fe . 

J\e\< v 2 

Therefore, / p k (9)f{9) dd — > oo as k — > oo, and this concludes the proof 
when / is real- valued. In general, write f(9) = u{9) + iv(9), where u and 
v are real- valued. If we define f(9) = f(9), then 


um= m±M 


and v{9) 


m - m 

2 i 


and since f(n) = f(—n), we conclude that the Fourier coefficients of u 
and v all vanish, hence / = 0 at its points of continuity. The idea 

of constructing a family of functions (trigonometric polynomials in this 
case) which peak at the origin, together with other nice properties, will 
play an important role in this book. Such families of functions will be 
taken up later in Section 4 in connection with the notion of convolution. 
For now, note that the above theorem implies the following. 

Corollary 2.2 If f is continuous on the circle and f(n) = 0 for all 
n € Z, then f = 0. 

The next corollary shows that the problem (1) formulated earlier has a 
simple positive answer under the assumption that the series of Fourier 
coefficients converges absolutely. 

Corollary 2.3 Suppose that f is a continuous function on the circle and 
that the Fourier series of f is absolutely convergent, \f( n )\ < °° • 

Then, the Fourier series converges uniformly to f, that is, 


lim S , at(/)(0) = f(9) uniformly in 9. 

N— >oo 

Proof. Recall that if a sequence of continuous functions converges 
uniformly, then the limit is also continuous. Now observe that the 
assumption |/(n)| < oo implies that the partial sums of the Fourier 
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series of / converge absolutely and uniformly, and therefore the function 
g defined by 


OO 

9(0) = E f(n)e ine 

n =— oo 


N 


lim 

iV— >00 


E /W eine 

n=—N 


is continuous on the circle. Moreover, the Fourier coefficients of g are 
precisely f(n) since we can interchange the infinite sum with the integral 
(a consequence of the uniform convergence of the series). Therefore, the 
previous corollary applied to the function f — g yields f = g, as desired. 

What conditions on / would guarantee the absolute convergence of its 

Fourier series? As it turns out, the smoothness of / is directly related 
to the decay of the Fourier coefficients, and in general, the smoother the 
function, the faster this decay. As a result, we can expect that relatively 
smooth functions equal their Fourier series. This is in fact the case, as 
we now show. 

In order to state the result concisely we introduce the standard “O” 
notation, which we will use freely in the rest of this book. For exam- 
ple, the statement f(n) — 0(l/\n\ 2 ) as \n\ — > oo, means that the left- 
hand side is bounded by a constant multiple of the right-hand side; 
that is, there exists C > 0 with |/(n)| < C/\n\ 2 for all large \n\. More 
generally, f(x) = 0(g(x )) as x — > a means that for some constant C, 
\f(x)\ < C\g(x) | as x approaches a. In particular, f(x) — 0(1) means 
that / is bounded. 


Corollary 2.4 Suppose that f is a twice continuously differentiable func- 
tion on the circle. Then 


f(n ) = 0(l/|n| 2 ) as \n\ — > oo, 


so that the Fourier series of f converges absolutely and uniformly to f. 
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Proof. The estimate on the Fourier coefficients is proved by integrating 
by parts twice for n ^ 0. We obtain 


2t r/(n)= f(9)e- ine dd 

Jo 

27 r 


m 


—e 


— inO 


m 


i r 2n 

= - / /'(% 


0 

— inO 


1 /" 27r 

+ — / f'(d)e- ine 
m Jo 


dQ 


>o 


m 

1 

in 

2 /*27T 


m 


— e 


—inO 


do 

-i 2 ? t 


= — / /"(% 

7T- 2 Vo 


in 

J o 

It ( s\\ „—inQ 


1 /‘ 27r 

+ WW/ f"(d)e~ ine d8 
{m) 2 Jo 


dO. 


The quantities in brackets vanish since / and /' are periodic. Therefore 


27 r|n| 2 |/(u)| < 


r2ir 


f"{6)e~ ine d9 


< 


\f'(9)\de<c, 


where the constant C is independent of n. (We can take C = 2ttB where 
B is a bound for f" .) Since 1/n 2 converges, the proof of the corollary 
is complete. 

Incidentally, we have also established the following important identity: 


f'{n) = inf(n), for all ngZ. 


If n ^ 0 the proof is given above, and if n — 0 it is left as an exercise to the 
reader. So if / is differentiable and / J2a n e ine , then f a n ine ind . 
Also, if / is twice continuously differentiable, then f" ~ Jf, a n(in) 2 e ind , 
and so on. Further smoothness conditions on / imply even better decay 
of the Fourier coefficients (Exercise 10). 

There are also stronger versions of Corollary 2.4. It can be shown, for 
example, that the Fourier series of / converges absolutely, assuming only 
that / has one continuous derivative. Even more generally, the Fourier 
series of / converges absolutely (and hence uniformly to /) if / satisfies 
a Holder condition of order a, with a > 1/2, that is, 

sup | f{9 + t) — f{9) | < A\t\ a for all t. 
e 

For more on these matters, see the exercises at the end of Chapter 3. 
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At this point it is worthwhile to introduce a common notation: we say 
that / belongs to the class C k if f is k times continuously differentiable. 
Belonging to the class C k or satisfying a Holder condition are two possible 
ways to describe the smoothness of a function. 


3 Convolutions 

The notion of convolution of two functions plays a fundamental role in 
Fourier analysis; it appears naturally in the context of Fourier series but 
also serves more generally in the analysis of functions in other settings. 

Given two 27r-periodic integrable functions / and g on R, we define 
their convolution f * g on [— tt, 7r] by 

(2) = f(y)a(x-y)dy. 

The above integral makes sense for each x, since the product of two 
integrable functions is again integrable. Also, since the functions are 
periodic, we can change variables to see that 

1 r 

U*g){x) = —J f(x-y)g(y)dy. 

Loosely speaking, convolutions correspond to “weighted averages.’' For 
instance, if g — 1 in (2), then f * g is constant and equal to ^ f* f(y ) dy, 
which we may interpret as the average value of / on the circle. Also, the 
convolution (/ * g)(x) plays a role similar to, and in some sense replaces, 
the pointwise product f(x)g( x) of the two functions / and g. 

In the context of this chapter, our interest in convolutions originates 
from the fact that the partial sums of the Fourier series of / can be 
expressed as follows: 


N 

s N (f)(x)= y 

n= — N 

= E {y f f(y)e- inv dy) e™ 

n=-N J -* J 

-s/>> 

= (f*D N )(x), 
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where Dn is the ./V th Dirichlet kernel (see Example 4) given by 

N 

D n (x)= Y, einX - 

n=—N 

So we observe that the problem of understanding 5jv(/) reduces to the 
understanding of the convolution f * Djy. 

We begin by gathering some of the main properties of convolutions. 

Proposition 3.1 Suppose that f, g, and h are 2n-periodic integrable 
functions. Then: 


(i) 

f* 

(g + h) 

= {f*g) 

+ ( f*h ). 

(ii) 

( cf ) *g = 

II 

* 

/ * (°g) f° r an v c g c 

(in) 

f* 

* 

II 

/. 


(iv) 

if" 

¥ g) * h 

* 

* 

II 

h). 

(v) 

f * 

g is continuous. 


(vi) 

f* 

g(n) = 

f(n)g(n). 



The first four points describe the algebraic properties of convolutions: 
linearity, commutativity, and associativity. Property (v) exhibits an im- 
portant principle: the convolution of / * g is “more regular” than / or g. 
Here, f * g is continuous while / and g are merely (Riemann) integrable. 
Finally, (vi) is key in the study of Fourier series. In general, the Fourier 
coefficients of the product fg are not the product of the Fourier coeffi- 
cients of / and g. However, (vi) says that this relation holds if we replace 
the product of the two functions / and g by their convolution f * g. 

Proof. Properties (i) and (ii) follow at once from the linearity of the 
integral. 

The other properties are easily deduced if we assume also that / and 
g are continuous. In this case, we may freely interchange the order of 
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integration. For instance, to establish (vi) we write 

/ * g( n ) = 2~^ _/" (/ *g){x)e~ mx dx 

=^Lh{L f{y)six ~ y) dy ) dx 

= i {h /I d, 

= hO ( ^~‘" y (hO^- , ''’ dx ) dy 

= f(n)g(n). 


To prove (iii), one first notes that if F is continuous and 27r-periodic, 
then 

/ 7T /»7T 

F(y ) dy= F(x — y) dy for any x € R. 

-7T J —7T 

The verification of this identity consists of a change of variables y i— > — y, 
followed by a translation y i— > y — x. Then, one takes F{y ) = f(y)g(x — y) 
Also, (iv) follows by interchanging two integral signs, and an appro- 
priate change of variables. 

Finally, we show that if / and g are continuous, then / * g is continu- 
ous. First, we may write 


(/*s)Oi) ~ 2) 


1 

27T 



f(y) [g Oi -y) - g(x 2 - y)\ dy. 


Since g is continuous it must be uniformly continuous on any closed 
and bounded interval. But g is also periodic, so it must be uniformly 
continuous on all of M; given e > 0 there exists 5 > 0 so that | g(s) — 
g(t ) | < e whenever |s — t\ < 5. Then, \x\ — X 2 \ < 5 implies |(xi — y) — 
{%2 ~ y) | <8 for any y, hence 


\{f*g)(xi) ~ (f*g){x 2 )| < — 


/ f(y) [g(x i - y) - g{x 2 - y )] dy 
J — 7T 

- 2tt / 1^)1 \ g ( Xl ~ yS) ~9i.X2 -y)\dy 

J midy 
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where B is chosen so that |/(x)| < B for all x. As a result, we conclude 
that f * g is continuous, and the proposition is proved, at least when / 
and g are continuous. 

In general, when / and g are merely integrable, we may use the re- 
sults established so far (when / and g are continuous), together with 
the following approximation lemma, whose proof may be found in the 
appendix. 

Lemma 3.2 Suppose f is integrable on the circle and bounded by B. 
Then there exists a sequence {fk}kLi of continuous functions on the 
circle so that 


sup \fk(x)\ < B for all k = 1,2, ... , 

X£ [ — 7T,7r] 


and 


/ I/O) ~ fk ( x ) | dx — > 0 as k — > oo. 

J —7V 


Using this result, we may complete the proof of the proposition as 
follows. Apply Lemma 3.2 to / and g to obtain sequences {fk} and {gk} 
of approximating continuous functions. Then 


/ * g - fk * gk = (/ - fk) * g + fk * (g - gk)- 

By the properties of the sequence {/*,}, 

l(/ - fk) * g(x) | < ^ j | f(x - y) - f k (x - y) | \g{y)\ dy 

< 7T su p \g(y)\ f I f(y) - fk(y)\ dy 

Z7T y J —n 

— > 0 as k — > oo. 


Hence {f — fk) * 9 ^ 0 uniformly in x. Similarly, fk* {g ~ 9k) ^ 0 uni- 
formly, and therefore fk * gk tends uniformly to f * g. Since each fk * gk 
is continuous, it follows that / * g is also continuous, and we have (v). 

Next, we establish (vi). For each fixed integer n we must have 
fk * 9k(n) — > / * g{n) as k tends to infinity since fk * gk converges uni- 
formly to f * g. However, we found earlier that fk( n )9k( n ) — fk * 9k ( n ) 
because both fk and gk are continuous. Hence 


\f(n) ~fk(n)\= — 

1 

< — 
“ 2tt 


[ (/O) - fk(x))e inx dx 

J —7T 

I I/O) - fk(x)\dx, 

J — 7T 
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and as a result we find that fk(n) — > f(n ) as k goes to infinity. Similarly 
gk(n) — > g(n), and the desired property is established once we let k tend 
to infinity. Finally, properties (iii) and (iv) follow from the same kind of 
arguments. 


4 Good kernels 


In the proof of Theorem 2.1 we constructed a sequence of trigonometric 
polynomials {p*,} with the property that the functions pk peaked at the 
origin. As a result, we could isolate the behavior of / at the origin. In 
this section, we return to such families of functions, but this time in a 
more general setting. First, we define the notion of good kernel, and 
discuss the characteristic properties of such functions. Then, by the use 
of convolutions, we show how these kernels can be used to recover a given 
function. 


A family of kernels {I\ n (x)}’^L 1 on the circle is said to be a family of 
good kernels if it satisfies the following properties: 


(a) For all n > 1, 


1 r 

— J K n (x)dx = 1. 


(b) There exists M > 0 such that for all n > 1, 

/ \K n (x) \ dx < M. 


(c) For every <5 > 0, 


' |:c| <7T 


\K n (x) \ dx — > 0, asm oo. 


In practice we shall encounter families where K n ( x) > 0, in which 
case (b) is a consequence of (a). We may interpret the kernels K n (x) 
as weight distributions on the circle: property (a) says that K n assigns 
unit mass to the whole circle [— n, 7r], and (c) that this mass concentrates 
near the origin as n becomes large. 6 Figure 4 (a) illustrates the typical 
character of a family of good kernels. 

The importance of good kernels is highlighted by their use in connec- 
tion with convolutions. 


6 In the limit, a family of good kernels represents the “Dirac delta function.” This 
terminology comes from physics. 
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y 


(a) 


(b) 


Figure 4. Good kernels 


Theorem 4.1 Let {K n }ff =1 be a family of good kernels, and f an inte- 
grate function on the circle. Then 


lim (/ * K n )(x) = f{x) 


whenever f is continuous at x. If f is continuous everywhere, then the 
above limit is uniform. 

Because of this result, the family { K n } is sometimes referred to as an 

approximation to the identity. 

We have previously interpreted convolutions as weighted averages. In 
this context, the convolution 



is the average of f(x — y), where the weights are given by K n (y). How- 
ever, the weight distribution K n concentrates its mass at y = 0 as n 
becomes large. Hence in the integral, the value f(x) is assigned the full 
mass as n — > oo. Figure 4 (b) illustrates this point. 

Proof of Theorem f.l. If e > 0 and / is continuous at x, choose <5 so 
that \y\ < 5 implies \ f(x — y) — f{x) \ < e. Then, by the first property of 
good kernels, we can write 
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Hence, 


\(f*K n )(x) - f(x)\ = 


< 


< 


^ j K n {y)[f(x - y) - f(x)\ dy 

I \Kn(y )\ I f(x y) f{x) I dy 
47T J\y\<5 

~r f \ K n{y)\ \f(x - y) - f{x)\dy 

27T Js<\y\<TT 

[ \ K n{y)\dy+ 2 ^- j \K n (y)\dy , 

27T J-n Zt: J 5<\y\<Tr 


where B is a bound for /. The first term is bounded by eM/2n because 
of the second property of good kernels. By the third property we see 
that for all large n, the second term will be less than e. Therefore, for 
some constant C > 0 and all large n we have 


\(f*K n )(x) - f(x ) | < Ce, 


thereby proving the first assertion in the theorem. If / is continuous 
everywhere, then it is uniformly continuous, and S can be chosen in- 
dependent of x. This provides the desired conclusion that / * K n — > / 
uniformly. 

Recall from the beginning of Section 3 that 


S N (f)(x) = (f*D N )(x), 

where Dn(x) = J2n=~N einx the Dirichlet kernel. It is natural now for 
us to ask whether Dn is a good kernel, since if this were true, Theorem 4.1 
would imply that the Fourier series of / converges to f(x) whenever / is 
continuous at x. Unfortunately, this is not the case. Indeed, an estimate 
shows that Dn violates the second property; more precisely, one has (see 
Problem 2) 


| D n (x) | dx > clog AT, as N — > oo. 


However, we should note that the formula for Dn as a sum of exponen- 
tials immediately gives 


1 

27T 



Dn(x) dx 


1 , 


so the first property of good kernels is actually verified. The fact that the 
mean value of Dn is 1, while the integral of its absolute value is large, 
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is a result of cancellations. Indeed, Figure 5 shows that the function 
D n(x) takes on positive and negative values and oscillates very rapidly 
as N gets large. 



Figure 5. The Dirichlet kernel for large N 


This observation suggests that the pointwise convergence of Fourier 
series is intricate, and may even fail at points of continuity. This is 
indeed the case, as we will see in the next chapter. 

5 Cesaro and Abel summability: applications to Fourier 
series 

Since a Fourier series may fail to converge at individual points, we are 
led to try to overcome this failure by interpreting the limit 

Jim S N (f) = f 

iV— >oo 


in a different sense. 

5.1 Cesaro means and summation 

We begin by taking ordinary averages of the partial sums, a technique 
which we now describe in more detail. 
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Suppose we are given a series of complex numbers 

OO 

c 0 + ci + c 2 h — = y; cfc. 

k = 0 

We define the n th partial sum s n by 

n 

Sn — 'y ^ Cfc , 
fc =0 

and say that the series converges to s if linp^oo s n = s. This is the 
most natural and most commonly used type of “summability.” Consider, 
however, the example of the series 

OO 

(3) 1-1 + 1-H E(-l) fe - 

fe =0 

Its partial sums form the sequence {1,0, 1,0,...} which has no limit. 
Because these partial sums alternate evenly between 1 and 0, one might 
therefore suggest that 1/2 is the “limit” of the sequence, and hence 1/2 
equals the “sum” of that particular series. We give a precise meaning to 
this by defining the average of the first N partial sums by 

So + S 1 + ' ' ' + SJV— 1 


The quantity a n is called the N th Cesaro mean 7 of the sequence {sk} 
or the IV th Cesaro sum of the series Y2kL o Cfc - 

If (Tat converges to a limit a as N tends to infinity, we say that the 
series Y2 c n is Cesaro summable to a. In the case of series of functions, 
we shall understand the limit in the sense of either pointwise or uniform 
convergence, depending on the situation. 

The reader will have no difficulty checking that in the above exam- 
ple (3), the series is Cesaro summable to 1/2. Moreover, one can show 
that Cesaro summation is a more inclusive process than convergence. In 
fact, if a series is convergent to s, then it is also Cesaro summable to the 
same limit s (Exercise 12). 

5.2 Fejer’s theorem 

An interesting application of Cesaro summability appears in the context 
of Fourier series. 


7 Note that if the series /_ ( c/, : begins with the term k = 1, then it is common prac- 
tice to define ajy = (si + • • • + sjy)/N. This change of notation has little effect on what 
follows. 
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We mentioned earlier that the Dirichlet kernels fail to belong to the 
family of good kernels. Quite surprisingly, their averages are very well 
behaved functions, in the sense that they do form a family of good ker- 
nels. 

To see this, we form the N th Cesaro mean of the Fourier series, which 
by definition is 


„„(/)(*) = S 0 U)(x) + --- + S N .,{/)(x) 

Since S n (f) — f * D n . we find that 

a-Jv(/)0) = (f*F N )( x), 
where F^(x) is the iV-th Fejer kernel given by 


F n (x) 


Dq(x) j h D N -i(x) 

N 


Lemma 5.1 We have 


F n {x ) 


1 sm 2 (Nx/2) 
N sin 2 (x/2) 


and the Fejer kernel is a good kernel. 

The proof of the formula for Fjy (a simple application of trigonometric 
identities) is outlined in Exercise 15. To prove the rest of the lemma, note 
that Fn is positive and fff F^ (x) dx = 1 , in view of the fact that a 
similar identity holds for the Dirichlet kernels D n . However, sin 2 (x/2) > 
c$ > 0, if 6 < |x| < 7 r, hence F N (x) < l/(Nc$), from which it follows that 


/ |F)v(a;)| dx — > 0 as N — > oo. 

J (5< |x| <7T 

Applying Theorem 4.1 to this new family of good kernels yields the 
following important result. 

Theorem 5.2 If f is integrable on the circle, then the Fourier series of 
f is Cesaro summable to f at every point of continuity of f. 

Moreover, if f is continuous on the circle, then the Fourier series of 
f is uniformly Cesaro summable to f. 

We may now state two corollaries. The first is a result that we have 
already established. The second is new, and of fundamental importance. 
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Corollary 5.3 If f is integrable on the circle and f(n) = 0 for all n, 
then f — 0 at all points of continuity of f. 

The proof is immediate since all the partial sums are 0, hence all the 
Cesaro means are 0. 

Corollary 5.4 Continuous functions on the circle can be uniformly ap- 
proximated by trigonometric polynomials. 

This means that if / is continuous on [ — 7r, 7 t] with /(— rr) — f(ir) and 
e > 0, then there exists a trigonometric polynomial P such that 

| f{x) — P(x) | < e for all — ir < x < n. 

This follows immediately from the theorem since the partial sums, hence 
the Cesaro means, are trigonometric polynomials. Corollary 5.4 is the 
periodic analogue of the Weierstrass approximation theorem for polyno- 
mials which can be found in Exercise 16. 


5.3 Abel means and summation 


Another method of summation was first considered by Abel and actually 
predates the Cesaro method. 

A series of complex numbers Cfe sa ^ to t> e Abel summable 

to s if for every 0 < r < 1, the series 

OO 

A(r ) = c k r k 

k = 0 


converges, and 


lim A(r ) = s. 

r— >1 


The quantities A{r) are called the Abel means of the series. One can 
prove that if the series converges to s, then it is Abel summable to s. 
Moreover, the method of Abel summability is even more powerful than 
the Cesaro method: when the series is Cesaro summable, it is always 
Abel summable to the same sum. However, if we consider the series 


1 — 2 + 3 — 4 + 5 = ^(-l) fe (fc + 1), 

k = o 

then one can show that it is Abel summable to 1 /4 since 

OO -j 

.4 W = £(-!)+ + 1+=-—^, 

k = 0 V ' 

but this series is not Cesaro summable; see Exercise 13. 


Ibookroot October 20, 2007 


5. Cesaro and Abel summability: applications to Fourier series 55 

5.4 The Poisson kernel and Dirichlet’s problem in the unit disc 

To adapt Abel summability to the context of Fourier series, we define 
the Abel means of the function f(9) ~ oo a nP ne by 

OO 

Mfm= E r H a n e ine . 

n =— oo 


Since the index n takes positive and negative values, it is natural to write 
Co = ao, and c„ = a n e md + o_ n e _m 6 for n > 0, so that the Abel means 
of the Fourier series correspond to the definition given in the previous 
section for numerical series. 

We note that since / is integrable, \a n \ is uniformly bounded in n, so 
that A r (f) converges absolutely and uniformly for each 0 < r < 1. Just 
as in the case of Cesaro means, the key fact is that these Abel means can 
be written as convolutions 


where P r (0) is the Poisson kernel given by 

OO 

(4) P r [9) = r |n| e* n0 . 

n =— oo 

In fact, 

OO 

MfW)= E ^a n e in6 

f{<p)e~ in * d^j e ine 

= lf n m ( E c bp, 

' 71 \n =— oo / 

where the interchange of the integral and infinite sum is justified by the 
uniform convergence of the series. 

Lemma 5.5 IfO < r < 1, then 



P r {0) = 


1 — 2 r cos 0 + r 2 
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The Poisson kernel is a good kernel , 8 as r tends to 1 from below. 

, _ 2 

Proof. The identity P r (9) — 1 _.- >r C( f s6+r 2 has already been derived in 
Section 1.1. Note that 

1 — 2rcos6 + r 2 = (1 — r) 2 + 2r(l — cos 6). 

Hence if 1/2 < r < 1 and 5 < \6\ < n, then 

1 — 2 r cos 0 + r 2 > c$ > 0. 

Thus P r {0) < (1 — r 2 )/cs when d < |0| < n, and the third property of 
good kernels is verified. Clearly P r {6) > 0, and integrating the expres- 
sion (4) term by term (which is justified by the absolute convergence of 
the series) yields 

^ f P r {6) d6 = 1, 

thereby concluding the proof that P r is a good kernel. 

Combining this lemma with Theorem 4.1, we obtain our next result. 

Theorem 5.6 The Fourier series of an integrable function on the circle 
is Abel summable to f at every point of continuity. Moreover, if f is 
continuous on the circle, then the Fourier series of f is uniformly Abel 
summable to f. 

We now return to a problem discussed in Chapter 1 , where we sketched 
the solution of the steady-state heat equation A u = 0 in the unit disc 
with boundary condition u = f on the circle. We expressed the Laplacian 
in terms of polar coordinates, separated variables, and expected that a 
solution was given by 

OO 

(5) u(r,0)= 'e imd , 

m=—oo 

where a m was the m th Fourier coefficient of /. In other words, we were 
led to take 

u(r, 6) = A r (f){6) = ^ j f(cp)P r (0 - ip) dp. 

We are now in a position to show that this is indeed the case. 


8 In this case, the family of kernels is indexed by a continuous parameter 0 < r < 1, 
rather than the discrete n considered previously. In the definition of good kernels, we 
simply replace n by r and take the limit in property (c) appropriately, for example r — > 1 
in this case. 
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Theorem 5.7 Let f be an integrable function defined on the unit circle. 
Then the function u defined in the unit disc by the Poisson integral 

(6) u(r,9) = {f * P r ){9) 

has the following properties: 

(i) u has two continuous derivatives in the unit disc and satisfies 
A u = 0. 

(ii) If 9 is any point of continuity of f, then 

lim u(r, 6) = f{9). 

r — >1 

If f is continuous everywhere, then this limit is uniform. 

(iii) If f is continuous, then u(r , 9) is the unique solution to the steady- 
state heat equation in the disc which satisfies conditions (i) and (ii). 

Proof. For (i), we recall that the function u is given by the series (5). 
Fix p < 1; inside each disc of radius r < p < 1 centered at the origin, the 
series for u can be differentiated term by term, and the differentiated se- 
ries is uniformly and absolutely convergent. Thus u can be differentiated 
twice (in fact infinitely many times), and since this holds for all p < 1, 
we conclude that u is twice differentiable inside the unit disc. Moreover, 
in polar coordinates, 


d 2 u 1 du 1 d 2 u 
dr 2 r dr r 2 d9 2 ’ 

so term by term differentiation shows that Au = 0. 

The proof of (ii) is a simple application of the previous theorem. To 
prove (iii) we argue as follows. Suppose v solves the steady-state heat 
equation in the disc and converges to / uniformly as r tends to 1 from 
below. For each fixed r with 0 < r < 1, the function v(r, 9) has a Fourier 
series 


OO 

a n (r)e ind where a n (r) 

n=—oc 


f v(r, 9)e~ in6 d9. 

J — 7i- 


Taking into account that v(r, 9) solves the equation 
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we find that 

1 n 2 

(8) o"(r) + - a' n (r ) - — a n {r ) = 0. 

Indeed, we may first multiply (7) by e _m0 and integrate in 6. Then, 
since v is periodic, two integrations by parts give 

Finally, we may interchange the order of differentiation and integra- 
tion, which is permissible since v has two continuous derivatives; this 
yields (8). 

Therefore, we must have a n (r) — A n r n + B n r~ n for some constants 
A n and B n . when 0 (see Exercise 11 in Chapter 1). To evaluate the 
constants, we first observe that each term a n (r) is bounded because v is 
bounded, therefore B n = 0. To find A n we let r — > 1. Since v converges 
uniformly to / as r — > 1 we find that 

An= ^f f^ e ~ in(>dd - 

By a similar argument, this formula also holds when n = 0. Our con- 
clusion is that for each 0 < r < 1, the Fourier series of v is given by the 
series of u(r,9), so by the uniqueness of Fourier series for continuous 
functions, we must have u — v. 

Remark. By part (iii) of the theorem, we may conclude that if u 
solves Ait = 0 in the disc, and converges to 0 uniformly as r — > 1, then 
u must be identically 0. However, if uniform convergence is replaced by 
pointwise convergence, this conclusion may fail; see Exercise 18. 

6 Exercises 

1. Suppose / is 27r-periodic and integrable on any finite interval. Prove that if 
o, b G R, then 


pb /»6+27r nb— 2 tt 

/ f(x) dx= f(x) dx= f(x) dx 

J a J a-\-27T J a—2 , K 


Also prove that 


/ 7T />7 T + a 

f[x) dx= f(x) dx. 

-7T J — 7T + a 
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2. In this exercise we show how the symmetries of a function imply certain 
properties of its Fourier coefficients. Let / be a 27r-periodic Riemann integrable 
function defined on R. 

(a) Show that the Fourier series of the function / can be written as 

f(6) ~ /( 0) + Yifjn) + /(— n)] cos nQ + i[f(n) - f(-n )] sin nd. 

n> 1 

(b) Prove that if / is even, then /(n) = /(— n), and we get a cosine series. 

(c) Prove that if / is odd, then /(n) = — /(— n), and we get a sine series. 

(d) Suppose that f(0 + 71) = f(6) for all 9 6 R. Show that /(n) = 0 for all 
odd n. 

(e) Show that / is real-valued if and only if /(n) = /(— n) for all n. 


3. We return to the problem of the plucked string discussed in Chapter 1. Show 
that the initial condition / is equal to its Fourier sine series 


F , , v'' , .,,4 2/i sin mp 

J(x )= } A m sin mx with A m = r. 

' m A pin — p) 

771=1 


[Hint: Note that \Am\ < C/m 2 .] 


4. Consider the 27r-periodic odd function defined on [0, 7r] by f(9) = 9( ir — 9). 

(a) Draw the graph of /. 

(b) Compute the Fourier coefficients of /, and show that 


m 



k odd > 1 


sin k6 
k 3 


5. On the interval [— n, n] consider the function 

m = 


0 if \e\ > 5, 

1 — \0\/5 if |0| < 5. 


Thus the graph of / has the shape of a triangular tent. Show that 


.... 5 \ ^ 1 — cos n5 

f(0) = - — x — - — cos n6. 

271 L ' 

71=1 


n 2 Tr5 


6. Let / be the function defined on [-71, tt] by f(0) = \9 
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(a) 

(b) 


Draw the graph of /. 

Calculate the Fourier coefficients of /, and show that 

( - if n = 0, 


/V) 


-! + (-!)" 
7rn 2 


if n ^ 0. 


(c) What is the Fourier series of / in terms of sines and cosines? 

(d) Taking 6 = 0 , prove that 



n odd >1 


and 



n= 1 


See also Example 2 in Section 1.1. 

7. Suppose {a n }^ = i and {b n }^ =1 are two finite sequences of complex numbers. 
Let Bk = TV, b n denote the partial sums of the series ^ b n with the convention 
/>’■: 0 . 

(a) Prove the summation by parts formula 

N N - 1 

^ ' O'nbn — & M B M —1 ^ ' (^n+1 ®n)7?n- 

n=M n=M 

(b) Deduce from this formula Dirichlet’s test for convergence of a series: if the 
partial sums of the series ^ b n are bounded, and {a„} is a sequence of 
real numbers that decreases monotonically to 0, then ^ a n b n converges. 


ginx 

8. Verify that — is Fourier series of the 27r-periodic sawtooth 

function illustrated in Figure 6, dehned by /( 0) = 0, and 

b ~ n < x < 0, 
if0<z<7r. 

Note that this function is not continuous. Show that nevertheless, the series 
converges for every x (by which we mean, as usual, that the symmetric partial 
sums of the series converge). In particular, the value of the series at the origin, 
namely 0, is the average of the values of /( x) as x approaches the origin from 
the left and the right. 
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Figure 6. The sawtooth function 


[Hint: Use Dirichlet’s test for convergence of a series ^ a n b n .} 


9. Let f{x) =X[ a ,t](a;) be the characteristic function of the interval [a,b\ C 
[ — 7r, 7r] , that is, 


X[a,b]( X ) 


1 if x G [a, 6], 
0 otherwise. 


(a) Show that the Fourier series of / is given by 


f(x) 


b — a 
2tt 


+ E 


g-ina g — inb 

2n in 


g inx 


The sum extends over all positive and negative integers excluding 0. 

(b) Show that if a / — 7r or b ^ tv and a / b, then the Fourier series does not 
converge absolutely for any x. [Hint: It suffices to prove that for many 
values of n one has | sinn0 o | > c > 0 where 9 0 = (b — a)/2.[ 

(c) However, prove that the Fourier series converges at every point x. What 
happens if a = — n and b = 7r? 


10 . Suppose / is a periodic function of period 2-7t which belongs to the class C k . 
Show that 

/(n) = 0(l/|n| fe ) as |n| — > oo. 

This notation means that there exists a constant C such |/(n)| < C /\n\ k . We 
could also write this as \n\ k f(n) = 0(1), where 0(1) means bounded. 

[Hint: Integrate by parts.] 

11. Suppose that {fk}kLi is a sequence of Riemann integrable functions on the 
interval [0, 1] such that 

fk{x) — f{pc) | dx — > 0 as k — > oo. 
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Show that /fc(n) — > f(n) uniformly in n as k — > oc. 

12. Prove that if a series of complex numbers 22 °n converges to s, then 22 c n 
is Cesaro summable to s. 

[Hint: Assume s n — > 0 as n — > oo.] 

13. The purpose of this exercise is to prove that Abel summability is stronger 
than the standard or Cesaro methods of summation. 

(a) Show that if the series 22n=i Cn °f complex numbers converges to a finite 
limit s, then the series is Abel summable to s. [Hint: Why is it enough to 
prove the theorem when s = 0? Assuming s = 0, show that if Sn = Ci + 

1- c N , then J2n = i c n r " = I 1 “ r ) X^=i s ^ rU + s nc n+1 ■ Let N -> oo 

to show that 

y, C n r n = (1 - r) y s n r n . 

Finally, prove that the right-hand side converges to 0 as r — > 1.] 

(b) However, show that there exist series which are Abel summable, but that 
do not converge. [Hint: Try c n = (— l) n . What is the Abel limit of 22 c rJ] 

(c) Argue similarly to prove that if a series 222=i Cn Cesaro summable to 
a, then it is Abel summable to a. [Hint: Note that 

OO OO 

y C n r n = (1 - r) 2 y ncr n r n , 

71=1 71=1 

and assume a = 0.] 

(d) Give an example of a series that is Abel summable but not Cesaro summable. 
[Hint: Try c n = (— l) n-1 n. Note that if 22 c n is Cesaro summable, then 
Cn/n tends to 0.] 

The results above can be summarized by the following implications about 
series: 


convergent => Cesaro summable => Abel summable, 
and the fact that none of the arrows can be reversed. 

14. This exercise deals with a theorem of Tauber which says that under an 
additional condition on the coefficients c n , the above arrows can be reversed. 

(a) If 22 c n is Cesaro summable to a and c n = o(l/n) (that is, nc n — » 0), then 
22 c n converges to a. [Hint: s n — a n = [(n — l)c n + • • • + C 2 ]/n.] 

(b) The above statement holds if we replace Cesaro summable by Abel summable. 
[Hint: Estimate the difference between XX=i Cn anc ^ E!,=i c nC n where 

r = 1 - l/N] 
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15. Prove that the Fejer kernel is given by 


Fn(x) 


1 sin 2 (Nx/2) 
N sin 2 (*/2) 


[Hint: Remember that NFn(x) = D 0 (x) + • • • + D^-i{x) where D n (x) is the 
Dirichlet kernel. Therefore, if oj = e lx we have 


NF n {x) = 


n = 0 


1 — C 0 


16. The Weierstrass approximation theorem states: Let / be a continuous 
function on the closed and bounded interval [a, b] C R. Then, for any e > 0, 
there exists a polynomial P such that 

sup \f(x) — P(x) | < e. 

x£ [a, 6] 

Prove this by applying Corollary 5.4 of Fejer’s theorem and using the fact that 
the exponential function e lx can be approximated by polynomials uniformly on 
any interval. 

17. In Section 5.4 we proved that the Abel means of / converge to / at all 
points of continuity, that is, 

lim A r (f)(0) = lim(P r * /)($) = f(Q), with 0 < r < 1, 

r — >-1 r — > 1 

whenever / is continuous at 9. In this exercise, we will study the behavior of 
A r (f)(9) at certain points of discontinuity. 

An integrable function is said to have a jump discontinuity at 9 if the two 
limits 


lim f(9 + h ) = f{9 + ) and 

h — ► 0 
h > 0 


lim f(9 — h) = f(9~) 

h — *■ 0 
h > 0 


exist. 

(a) Prove that if / has a jump discontinuity at 0 , then 

lim Ar(f){6) = \ W ith 0 < r < 1. 

r— ► 1 2 

[Hint: Explain why -F P r (9) d9 = -F f* P r {9) d9 = |, then modify 
the proof given in the text.] 
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(b) Using a similar argument, show that if / has a jump discontinuity at 9, 
the Fourier series of / at 9 is Cesaro summable to ZilhJiiJJLJ. 


18. If P r (9) denotes the Poisson kernel, show that the function 


u(r, 9) 


dP r 

~d0' 


defined for 0 < r < 1 and 9 £ R, satisfies: 

(i) Am = 0 in the disc. 

(ii) lim,.-,! u(r , 9) = 0 for each 9. 
However, u is not identically zero. 


19. Solve Laplace’s equation Am = 0 in the semi infinite strip 

S = {(: v,y) : 0 < x < 1, 0 < y}, 

subject to the following boundary conditions 

{ m(0, y) = 0 when 0 <y, 
m(1, y) = 0 when 0 < y, 
u(x, 0) = f(x) when 0 < x < 1 

where / is a given function, with of course /( 0) = /( 1) = 0. Write 

OO 

/(*) = E a n sin(?7.7r:r) 

n = 1 

and expand the general solution in terms of the special solutions given by 

u n (x, y ) = e _n7r!/ sin(n7ra;). 

Express u as an integral involving /, analogous to the Poisson integral for- 
mula (6). 

20. Consider the Dirichlet problem in the annulus defined by {(?', 9) : p < r < 1}, 
where 0 < p < 1 is the inner radius. The problem is to solve 

d 2 u Idu 1 d 2 u _ 

dr 2 r dr r 2 d9 2 

subject to the boundary conditions 


u(i,9) = m, 

u(p,9) = g{9 ), 
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where / and g are given continuous functions. 

Arguing as we have previously for the Dirichlet problem in the disc, we can 
hope to write 

u(r,9) = y ^c n (r)e m0 
with c n (r) = A n r n + B n r~ n , n/0. Set 

m ~ a n e m9 and g(9) ~ ^ 

We want c n (l) = a„ and c n (p ) = b n . This leads to the solution 

« (r, 0) = Y,( 0 n \-n) W/ r T - ( r /pD On + (r n - r~ n )b n ] e- e 

+a 0 + (b 0 - a 0 )p^. 

logp 

Show that as a result we have 

u(r, 9) — ( P r * /)(#) ~ > 0 as r — > 1 uniformly in 9, 

and 

u(r, 9) — ( Pp/ r * g)(0) — > 0 as r — * p uniformly in 9. 


7 Problems 

1. One can construct Riemann integrable functions on [0, 1] that have a dense 
set of discontinuities as follows. 

(a) Let /( x) = 0 when x < 0, and f(x) = 1 if x > 0. Choose a countable dense 
sequence {r„} in [0, 1], Then, show that the function 

oo 

F ( x ) = ~ 
n=l 

is integrable and has discontinuities at all points of the sequence {r n }. 
[Hint: F is monotonic and bounded.] 

(b) Consider next 

OO 

F{x ) = ~ n g(x - r n ), 

n= 1 

where g(x) = sinl/cc when x ^ 0, and g( 0) = 0. Then F is integrable, 
discontinuous at each x = r n , and fails to be monotonic in any subinterval 
of [0, 1]. [Hint: Use the fact that 3 -fc > ’Yh n>k 3 - ".] 
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(c) The original example of Riemann is the function 

*w-E^. 

71=1 

where (x) = x for x € (—1/2, 1/2] and ( x ) is continued to R by periodicity, 
that is, (x + 1) = (x). It can be shown that F is discontinuous whenever 
x = m/2n , where m,n£Z with m odd and n ^ 0. 


2 . Let Du denote the Dirichlet kernel 


p rp\ r ik6 sin((iV + 1/2)0) 

D " w A, e * • 


and define 


Iiv — 


2tt 


|Djv(0)|d0. 


(a) Prove that 

Ljv > c log N 

for some constant c > 0. [Hint: Show that |Z?jv(^)| > c sm((iv+i/2)9) ^ g^g^gg 
variables, and prove that 

/ N 7T | . nl 

^j^d<9 + 0(l). 

Write the integral as a sum ^2^=i fkn +1 ^' conclude, use the fact that 
y//_i l/k > clogn.] A more careful estimate gives 

L n = log AT + 0(1). 

7T Z 

(b) Prove the following as a consequence: for each n > 1, there exists a contin- 
uous function f n such that |/ ra | < 1 and |5 n (/„)(0)| > c'logn. [Hint: The 
function g n which is equal to 1 when D n is positive and —1 when D n is 
negative has the desired property but is not continuous. Approximate g n 
in the integral norm (in the sense of Lemma 3.2) by continuous functions 
/ifc satisfying \hk\ < 1.] 


3 .* Littlewood provided a refinement of Tauber’s theorem: 
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(a) If c n is Abel summable to s and c n = 0(l/n), then c n converges to 
s. 

(b) As a consequence, if ^ c n is Cesaro summable to s and c n = 0(l/n), then 

c n converges to s. 

These results may be applied to Fourier series. By Exercise 17, they imply that 
if / is an integrable function that satisfies f(v) = 0( l/|z'|), then: 

(i) If / is continuous at 9, then 

S N (fm m as N -» 00 . 


(ii) If / has a jump discontinuity at 9, then 


s N (fm 


f(9+)+f(9~) 

2 


as N — > oo. 


(iii) If / is continuous on [ — 7r, 7r] , then Sn(f) — ► / uniformly. 

For the simpler assertion (b), hence a proof of (i), (ii), and (iii), see Problem 5 
in Chapter 4. 
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Convergence of Fourier Series 


The sine and cosine series, by which one can repre- 
sent an arbitrary function in a given interval, enjoy 
among other remarkable properties that of being con- 
vergent. This property did not escape the great ge- 
ometer (Fourier) who began, through the introduc- 
tion of the representation of functions just mentioned, 
a new career for the applications of analysis; it was 
stated in the Memoir which contains his first research 
on heat. But no one so far, to my knowledge, gave a 
general proof of it . . . 


G. Dirichlet, 1829 


In this chapter, we continue our study of the problem of convergence 
of Fourier series. We approach the problem from two different points of 
view. 

The first is “global” and concerns the overall behavior of a function 
/ over the entire interval [0, 27 t] . The result we have in mind is “mean- 
square convergence” : if / is integrable on the circle, then 



as N 


oo. 


At the heart of this result is the fundamental notion of “orthogonal- 
ity” ; this idea is expressed in terms of vector spaces with inner products, 
and their related infinite dimensional variants, the Hilbert spaces. A 
connected result is the Parseval identity which equates the mean-square 
“norm” of the function with a corresponding norm of its Fourier coeffi- 
cients. Orthogonality is a fundamental mathematical notion which has 
many applications in analysis. 

The second viewpoint is “local” and concerns the behavior of / near a 
given point. The main question we consider is the problem of pointwise 
convergence: does the Fourier series of / converge to the value f{9) 
for a given 9 ? We first show that this convergence does indeed hold 
whenever / is differentiable at 9. As a corollary, we obtain the Riemann 
localization principle, which states that the question of whether or not 
Sn{I){9) — > f(0) is completely determined by the behavior of / in an 
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arbitrarily small interval about 6. This is a remarkable result since the 
Fourier coefficients, hence the Fourier series, of / depend on the values 
of / on the whole interval [0, 27 t], 

Even though convergence of the Fourier series holds at points where 
/ is differentiable, it may fail if / is merely continuous. The chapter 
concludes with the presentation of a continuous function whose Fourier 
series does not converge at a given point, as promised earlier. 

1 Mean-square convergence of Fourier series 

The aim of this section is the proof of the following theorem. 

Theorem 1.1 Suppose f is integrable on the circle. Then 



as N 


oo. 


As we remarked earlier, the key concept involved is that of orthogonal- 
ity. The correct setting for orthogonality is in a vector space equipped 
with an inner product. 

1.1 Vector spaces and inner products 

We now review the definitions of a vector space over IR or C, an inner 
product, and its associated norm. In addition to the familiar finite- 
dimensional vector spaces M. d and C d , we also examine two infinite- 
dimensional examples which play a central role in the proof of Theo- 
rem 1.1. 

Preliminaries on vector spaces 

A vector space V over the real numbers IR is a set whose elements may be 
“added” together, and “multiplied” by scalars. More precisely, we may 
associate to any pair 1,7 G V an element in V called their sum and 
denoted by X + Y. We require that this addition respects the usual laws 
of arithmetic, such as commutativity X + Y — Y + X, and associativity 
X + (Y + Z) — {X + Y) + Z, etc. Also, given any X £ V and real num- 
ber A, we assign an element AX £ V called the product of X by A. This 
scalar multiplication must satisfy the standard properties, for instance 
Ai(A 2 X) = (AiA 2 )X and A (X + Y) = AX + AY. We may instead allow 
scalar multiplication by numbers in C; we then say that V is a vector 
space over the complex numbers. 
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For example, the set of d-tuples of real numbers (x\,X 2 , • • ■ , Xd) is 
a vector space over the reals. Addition is defined componentwise by 


(xi , . . . ,x d ) + (yi, . . . ,y d ) = (x! +y 1: . . . ,x d + y d ) 


and so is multiplication by a scalar A £ M: 


A(xi, ...,Xd) — (Aaq, A Xd). 


Similarly, the space C d (the complex version of the previous example) 
is the set of d-tuples of complex numbers (zi,Z 2 , ■ ■ . , z d )- It is a vector 
space over C with addition defined componentwise by 

(zi, ...,z d ) + {wi, . . ,,w d ) = (zi + wi, . . . ,Zd + w d ). 

Multiplication by scalars A £ C is given by 


X(zi,...,Zd) = (\zi,...,\zd). 


An inner product on a vector space V over M associates to any pair 
X,Y of elements in V a real number which we denote by (X,Y). In 
particular, the inner product must be symmetric (X, Y) = (Y, X) and 
linear in both variables; that is, 


(aX + 1 3Y , Z) - a(X, Z) + f3(Y, Z) 


whenever a, /3 £ M and X, Y, Z £ V. Also, we require that the inner prod- 
uct be positive-definite, that is, (X, X) > 0 for all X in V. In particular, 
given an inner product ( ■ , ■ ) we may define the norm of X by 


X|| = (X^) 1 / 2 . 


If in addition ||X|[ =0 implies X = 0, we say that the inner product is 
strictly positive-definite. 


For example, the space M. d is equipped with a (strictly positive-definite) 
inner product defined by 


(X, 1') = xiyi H b x d y d 


when X = (xi, . . . , x d ) and Y = (jq, . . . , y d ). Then 
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which is the usual Euclidean distance. One also uses the notation X 
instead of ||X||. 

For vector spaces over the complex numbers, the inner product of two 
elements is a complex number. Moreover, these inner products are called 
Hermitian (instead of symmetric) since they must satisfy 
(X, Y) = (Y,X). Hence the inner product is linear in the first variable, 
but conjugate-linear in the second: 


(aX + /3Y, Z ) = a(X, Z) + 0(Y, Z) and 

( X , aY + (3Z) = a(X, Y) + 0(X, Z). 

Also, we must have (X, X) > 0, and the norm of X is defined by 
||X || = (X, X) 1 / 2 as before. Again, the inner product is strictly positive- 
definite if ||X || = 0 implies X = 0. 

For example, the inner product of two vectors Z — (zi, z d ) and 
W = {w i, . . . , Wd) in C d is defined by 


(Z, W) = ziwi H h z d w d . 


The norm of the vector Z is then given by 

\\z\\ - (Z, Z ) 1 / 2 = ^M 2 + --- + M 2 . 

The presence of an inner product on a vector space allows one to define 
the geometric notion of “orthogonality.” Let V be a vector space (over R 
or C) with inner product (■, ■) and associated norm || ■ ||. Two elements 
X and Y are orthogonal if (X, Y) = 0, and we write X _L Y. Three 
important results can be derived from this notion of orthogonality: 

(i) The Pythagorean theorem: if X and Y are orthogonal, then 

||x + y || 2 = ||x || 2 + ||y'|| 2 . 

(ii) The Cauchy-Schwarz inequality: for any X, Y € V we have 

|(X,1')| < ||X|| ||F||. 

(iii) The triangle inequality: for any X,Y £ V we have 


Ibookroot October 20, 2007 


1. Mean-square convergence of Fourier series 73 

The proofs of these facts are simple. For (i) it suffices to expand 
( X + Y, X + Y ) and use the assumption that (X, Y) = 0. 

For (ii), we first dispose of the case when ||Y|| =0 by showing that 
this implies (X, Y) — 0 for all X. Indeed, for all real t we have 

0< ||X + tY|| 2 = ||X|| 2 + 2fRe(X,Y) 

and Re(X, Y) ^ 0 contradicts the inequality if we take t to be large and 
positive (or negative). Similarly, by considering \\X + itY|| 2 , we find that 
Im(X, Y) = 0. 

If ||Y|| 0, we may set c — (X, Y)/(Y. Y); then X — cY is orthogonal 

to Y, and therefore also to cY. If we write X = X — cY + cY and apply 
the Pythagorean theorem, we get 

||X|| 2 = || X - cY || 2 + ||cY|| 2 > |c| 2 ||Y|| 2 . 

Taking square roots on both sides gives the result. Note that we have 
equality in the above precisely when X = cY . 

Finally, for (iii) we first note that 

||X + Y|| 2 = (X, X) + (X, Y) + (Y, X) + (Y, Y). 

But (X, X) = 1 1 X 1 1 2 , (Y, Y) = || Y|| 2 , and by the Cauchy-Schwarz inequal- 
ity 

I (X, Y) + (Y, X)| < 2 ||X || 1 1 Y 1 1 , 


therefore 

||X + Y || 2 < ||X|| 2 + 2 ||X|| 1 1 Y |:| + || Y|| 2 = (||X|| + || Y||) 2 . 

Two important examples 

The vector spaces M. d and C d are finite dimensional. In the context 
of Fourier series, we need to work with two infinite-dimensional vector 
spaces, which we now describe. 

Example 1. The vector space f? 2 (Z) over C is the set of all (two-sided) 
infinite sequences of complex numbers 

(. . . , U_ n , . . . , U—i, U0; ■ j Q"ni • • •) 


\a n \ 2 < oc; 

n£Z 


such that 
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that is, the series converges. Addition is defined componentwise, and 
so is scalar multiplication. The inner product between the two vectors 
A = (. . . , a_i, ao, ai, . . .) and B = (. . . , b-i,bo, bi, . . .) is defined by the 
absolutely convergent series 

(A,B) = ^2 a nK ■ 

n£Z 


The norm of A is then given by 


\\A\\ = {A,A)^= (EM 2 ) ' 

\n£Z / 

We must first check that £ 2 (Z) is a vector space. This requires that if 
A and B are two elements in t? 2 (Z), then so is the vector A + B. To see 
this, for each integer N > 0 we let A N denote the truncated element 

An = (•••, 0, 0, a-N, ■ ■ • , «-i, a 0 , oi, . . . , ajv, 0, 0, . . .), 

where we have set a n — 0 whenever |n| > N. We define the truncated 
element Bn similarly. Then, by the triangle inequality which holds in a 
finite dimensional Euclidean space, we have 

||Ajv + -Bjvll < \\A N \\ + \\B n \\ < ||A|| + ||B||. 

Thus 

I a n + b n | 2 < (||A|| + ||i?||) 2 , 

\n\< N 

and letting N tend to infinity gives J2 n ez \ a n + ^n\ 2 < oo. It also fol- 
lows that || A + B|| < ||A|| + ||B|| , which is the triangle inequality. The 
Cauchy-Schwarz inequality, which states that the sum J2nez a nbn con- 
verges absolutely and that |(A, B)\ < ||A|| ||i?||, can be deduced in the 
same way from its finite analogue. 

In the three examples IR d , C d , and ^ 2 (Z), the vector spaces with their 
inner products and norms satisfy two important properties: 

(i) The inner product is strictly positive-definite, that is, ||A|| = 0 
implies X — 0. 

(ii) The vector space is complete, which by definition means that 
every Cauchy sequence in the norm converges to a limit in the 
vector space. 
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An inner product space with these two properties is called a Hilbert 
space. We see that IR d and C d are examples of finite-dimensional Hilbert 
spaces, while l 2 (T,) is an example of an infinite-dimensional Hilbert space 
(see Exercises 1 and 2). If either of the conditions above fail, the space 
is called a pre-Hilbert space. 

We now give an important example of a pre-Hilbert space where both 
conditions (i) and (ii) fail. 

Example 2. Let TZ denote the set of complex- valued Riemann integrable 
functions on [0,27 t] (or equivalently, integrable functions on the circle). 
This is a vector space over C. Addition is defined pointwise by 

(f + gm = m + g(e). 

Naturally, multiplication by a scalar A € C is given by 

(A/)(0) = A-/(0). 

An inner product is defined on this vector space by 

(i) Uig) = ^fJf(.o)W)de. 

The norm of / is then 



One needs to check that the analogue of the Cauchy- Schwarz and tri- 
angle inequalities hold in this example; that is, \(f,g)\ < ||/|| ||g|| and 
||/ + g || < || /|| + || 5 1| . While these facts can be obtained as consequences 
of the corresponding inequalities in the previous examples, the argument 
is a little elaborate and we prefer to proceed differently. 

We first observe that 2 AB < (A 2 + B 2 ) for any two real numbers A 
and B. If we set A = A 1 / 2 |/(0)| and B = \~ l / 2 \g(9)\ with A > 0, we get 

\f(6)W)\<\wfm 2 + 

We then integrate this in 6 to obtain 

\(f,g)\ < ^ J** \m\ \W)\ do < |(A||/H 2 + A- 1 !!#). 

Then, put A = 11511/11/11 to get the Cauchy-Schwarz inequality. The tri- 
angle inequality is then a simple consequence, as we have seen above. 
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Of course, in our choice of A we must assume that ||/|| ^ 0 and ||g|| ^ 0, 
which leads us to the following observation. 

In 7 Z, condition (i) for a Hilbert space fails, since ||/|| =0 implies only 
that / vanishes at its points of continuity. This is not a very serious 
problem since in the appendix we show that an integrable function is 
continuous except for a “negligible” set, so that ||/|| =0 implies that / 
vanishes except on a set of “measure zero.” One can get around the 
difficulty that / is not identically zero by adopting the convention that 
such functions are actually the zero function, since for the purpose of 
integration, / behaves precisely like the zero function. 

A more essential difficulty is that the space 7 Z is not complete. One 
way to see this is to start with the function 



0 for 9 = 0, 

log(l/0) for 0 < 9 < 2n. 


Since / is not bounded, it does not belong to the space TZ. Moreover, 
the sequence of truncations f n defined by 


0 for 0 < 6 < l/n, 
f(6) for l/n < 8 <2n 


fn{8) 


can easily be seen to form a Cauchy sequence in TZ (see Exercise 5). How- 
ever, this sequence cannot converge to an element in TZ, since that limit, 
if it existed, would have to be /; for another example, see Exercise 7. 

This and more complicated examples motivate the search for the com- 
pletion of TZ, the class of Riemann integrable functions on [0, 2 tt]. The 
construction and identification of this completion, the Lebesgue class 
L 2 ([0, 27 t]), represents an important turning point in the development of 
analysis (somewhat akin to the much earlier completion of the rationals, 
that is, the passage from Q to K). A further discussion of these fun- 
damental ideas will be postponed until Book III, where we take up the 
Lebesgue theory of integration. 

We now turn to the proof of Theorem 1.1. 

1.2 Proof of mean-square convergence 

Consider the space TZ of integrable functions on the circle with inner 
product 
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and norm ||/|| defined by 

1 r 27V 

ii/ii 2 = (/,/) = ^/ o \m\ 2 do. 


With this notation, we must prove that \\f — Sjv(/)|| — > 0 as N tends to 
infinity. 

For each integer n, let e n (d) — e me , and observe that the family {e n } neZ 

is orthonormal; that is, 


(®n> 


^m) 


1 if n = to 
0 if n / to. 


Let / be an integrable function on the circle, and let a n denote its Fourier 
coefficients. An important observation is that these Fourier coefficients 
are represented by inner products of / with the elements in the orthonor- 
mal set {e n } n€Z : 


(/,e„) = Er C f^ e ~ in(> d6 = 

In particular, 5V(/) = F|«|<jv a n e n- Then the orthonormal property of 
the family {e n } and the fact that a n — (/, e n ) imply that the difference 
/ — E|n|<w a n&n is orthogonal to e n for all \n\ < N. Therefore, we must 
have 


(2) (/ 'y \ Q>n&n) T 'y ' b n e n 

|n|<JV |n|<JV 

for any complex numbers b n . We draw two conclusions from this fact. 
First, we can apply the Pythagorean theorem to the decomposition 

f / 'y , Q'n&n T ^ ~ a n e n , 

|n|<JV |ri| <iV 


where we now choose b n — a n , to obtain 


= 11 /- 


|n|<JV 


dn&n 


\n\ <N 


Since the orthonormal property of the family {e n } n ^z implies that 
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we deduce that 


(3) 


ll/ll 2 = 11/ -M/)|| 2 + E Kl 2 


|n|<JV 


The second conclusion we may draw from (2) is the following simple 
lemma. 

Lemma 1.2 (Best approximation) If f isintegrable on the circle with 
Fourier coefficients a n , then 


\\f-s N (f)\\<\\f- e 


|n|<iV 


for any complex numbers c n . Moreover, equality holds precisely when 
c n — o n for all \n\ < N. 

Proof. This follows immediately by applying the Pythagorean theo- 
rem to 



|n|<IV 


\n\<N 


where b n — a n — c n . 

This lemma has a clear geometric interpretation. It says that the 
trigonometric polynomial of degree at most N which is closest to / in 
the norm || ■ || is the partial sum 5jv(/). This geometric property of the 
partial sums is depicted in Figure 1, where the orthogonal projection of 
/ in the plane spanned by {e_jv, . . . , eg, . . . , ejv} is simply S^{f). 



f - S N (f) 



Figure 1 . The best approximation lemma 


We can now give the proof that ||S)v(/) — /|| — > 0 using the best ap- 
proximation lemma, as well as the important fact that trigonometric 
polynomials are dense in the space of continuous functions on the circle. 
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Suppose that / is continuous on the circle. Then, given e > 0, there 
exists (by Corollary 5.4 in Chapter 2) a trigonometric polynomial P, say 
of degree M, such that 

|/(0) -P(0)| < e for all 9. 

In particular, taking squares and integrating this inequality yields 
||/ — P|| < e, and by the best approximation lemma we conclude that 

11/ — (/) || < e whenever N > M. 

This proves Theorem 1.1 when / is continuous. 

If / is merely integrable, we can no longer approximate / uniformly 
by trigonometric polynomials. Instead, we apply the approximation 
Lemma 3.2 in Chapter 2 and choose a continuous function g on the 
circle which satisfies 


sup \g(9)\ < sup \f(9)\ = B, 

Be [0,277] Be [0,27r] 

and 

/»27T 

/ \f(e)-g(6)\de<e 2 . 

Jo 

Then we get 

= ^fj\m-g{e)\\m-g{e)\ d e 

fj \f(0)-g(e)\d8 
< Ce 2 . 

Now we may approximate g by a trigonometric polynomial P so that 
|| g — P|| < e. Then ||/ — P|| < C'e, and we may again conclude by ap- 
plying the best approximation lemma. This completes the proof that the 
partial sums of the Fourier series of / converge to / in the mean square 
norm || ■ ||. 

Note that this result and the relation (3) imply that if a n is the n th 
Fourier coefficient of an integrable function /, then the series ]P^L-oo l a «l' 
converges, and in fact we have Parseval’s identity 

OO 

E i«»r = ii/f- 


n =— oo 
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This identity provides an important connection between the norms in 
the two vector spaces £ 2 (Z) and 7 Z. 

We now summarize the results of this section. 


Theorem 1.3 Let f be an integrable function on the circle with 
f ~ X^-oo ««e me . Then we have: 

(i) Mean-square convergence of the Fourier series 

— / | f(9) - S N (f)(d) | 2 dd^ 0 asN^oo. 

J o 


(ii) ParsevaVs identity 


E 



n=—oo 


l r 2n 

2?r J 0 


IfWfdO. 


Remark 1. If {e n } is any orthonormal family of functions on the 
circle, and a n = (/, e n ), then we may deduce from the relation (3) that 

OO 

E Kl 2 < ll/ll 2 . 

n=— oo 

This is known as Bessel’s inequality. Equality holds (as in Parseval’s 
identity) precisely when the family {e n } is also a “basis,” in the sense 
that II X|n|<jv a n e n - /|| -> 0 as N — > oo. 

Remark 2. We may associate to every integrable function the se- 
quence {a n } formed by its Fourier coefficients. Parseval’s identity guar- 
antees that {a n } € i 2 ifL). Since £ 2 (Z) is a Hilbert space, the failure of 1Z 
to be complete, discussed earlier, may be understood as follows: there 
exist sequences {a n } ne z such that Xngz l a n| 2 < 00 ; Y e f 110 Riemann in- 
tegrable function F has n th Fourier coefficient equal to a n for all n. An 
example is given in Exercise 6. 

Since the terms of a converging series tend to 0, we deduce from Par- 
seval’s identity or Bessel’s inequality the following result. 

Theorem 1.4 (Riemann-Lebesgue lemma) If f is integrable on the 
circle, then f{n) — > 0 as \n\ — > 00 . 

An equivalent reformulation of this proposition is that if / is integrable 
on [0, 2tt], then 
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and 


2? r 

f(0) cos(N6) dO — > 0 as 77 — > oo. 

To conclude this section, we give a more general version of the Parseval 
identity which we will use in the next chapter. 

Lemma 1.5 Suppose F and G are integrable on the circle with 

a ne in6 and G ~ b n e inB . 


Then 


l-j'J F{6)G(6)d6 


OO 

'y ^ a n b n . 

n =— oo 


Recall from the discussion in Example 1 that the series J2^L-oo a nb n 
converges absolutely. 

Proof. The proof follows from Parseval’s identity and the fact that 

(F, G) = \ [||F + G\\ 2 - ||F - G\\ 2 + i (||F + iG\\ 2 - ||F - *G|| 2 )] 

which holds in every Hermitian inner product space. The verification of 
this fact is left to the reader. 


2 Return to pointwise convergence 

The mean-square convergence theorem does not provide further insight 
into the problem of pointwise convergence. Indeed, Theorem 1.1 by itself 
does not guarantee that the Fourier series converges for any 9. Exercise 3 
helps to explain this statement. However, if a function is differentiable 
at a point 8q, then its Fourier series converges at 0q. After proving this 
result, we give an example of a continuous function with diverging Fourier 
series at one point. These phenomena are indicative of the intricate 
nature of the problem of pointwise convergence in the theory of Fourier 
series. 

2.1 A local result 

Theorem 2.1 Let f be an integrable function on the circle which is dif- 
ferentiable at a point 9 0 . Then S , w(/)(^o) ~ > /(# o) as AT tends to infinity. 
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Proof. Define 


m 


m - l) - m) if f ^ 0 and |t| < 

-no o) it t = o. 


First, F is bounded near 0 since / is differentiable there. Second, for 
all small 5 the function F is integrable on [— n, —5] U [5, 7r] because / has 
this property and \t\ > 5 there. As a consequence of Proposition 1.4 in 
the appendix, the function F is integrable on all of [— 7r,7r]. We know 
that S/v(/)(#o) = (f * D n )(6 0 ), where D N is the Dirichlet kernel. Since 
jA f D n — 1, we find that 

S N (f)(0 o ) - f(0o) =* ^ £ f(Oo - t)D N (t) dt - f(6 0 ) 

=» ^ f\m -t)~ m)]D N (t) dt 
- ~ J F(t)tD N (t) dt. 

We recall that 

tD,v(t) = *HF2) sill((Ar + 1/2)t) ’ 


where the quotient sin A/ 2 ) is continuous in the interval [ — 7r, vt] . Since we 
can write 


sin((AT + 1/2 )t) = sin (Nt) cos(t/2) + cos (Aft) sin(t/2), 

we can apply the Riemann-Lebesgue lemma to the Riemann integrable 
functions F (t)t cos(t / 2) / sin(t/2) and F(t)t to finish the proof of the the- 
orem. 

Observe that the conclusion of the theorem still holds if we only assume 
that / satisfies a Lipschitz condition at 8 0 ; that is, 

\f(8)~ f(6 0 )\<M\6-d 0 \ 

for some M > 0 and all 8. This is the same as saying that / satisfies a 
Holder condition of order a — 1. 

A striking consequence of this theorem is the localization principle of 
Riemann. This result states that the convergence of Sn(/)(0 o) depends 
only on the behavior of / near 8q. This is not clear at first, since forming 
the Fourier series requires integrating / over the whole circle. 
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Theorem 2.2 Suppose f and g are two integrable functions defined on 
the circle, and for some do there exists an open interval I containing do 
such that 


f(0) = g {Q) for all del. 


Then SW(/)(^o) — 5V(g)(#o) — ► 0 as N tends to infinity. 

Proof. The function / — g is 0 in I , so it is differentiable at 0q, and 
we may apply the previous theorem to conclude the proof. 

2.2 A continuous function with diverging Fourier series 

We now turn our attention to an example of a continuous periodic func- 
tion whose Fourier series diverges at a point. Thus, Theorem 2.1 fails 
if the differentiability assumption is replaced by the weaker assumption 
of continuity. Our counter-example shows that this hypothesis which 
had appeared plausible, is in fact false; moreover, its construction also 
illuminates an important principle of the theory. 

The principle that is involved here will be referred to as “symmetry- 
breaking.” 1 The symmetry that we have in mind is the symmetry be- 
tween the frequencies e m 0 and e _mS which appear in the Fourier expan- 
sion of a function. For example, the partial sum operator Sn is defined 
in a way that reflects this symmetry. Also, the Dirichlet, Fejer, and 
Poisson kernels are symmetric in this sense. When we break the symme- 
try, that is, when we split the Fourier series EnL-oo a n^ me into the two 
pieces Y.n^o^e 1710 and En<o a ne inB , we introduce new and far-reaching 
phenomena. 

We give a simple example. Start with the sawtooth function / which is 
odd in d and which equals i( n — d) when 0 < d < n. Then, by Exercise 8 
in Chapter 2, we know that 



Consider now the result of breaking the symmetry and the resulting series 


Tl — 1 inf) 



7l= — OO 


Then, unlike (4), the above is no longer the Fourier series of a Riemann 
integrable function. Indeed, suppose it were the Fourier series of an 


1 We have borrowed this terminology from physics, where it is used in a very different 
context. 
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integrable function, say /, where in particular / is bounded. Using the 
Abel means, we then have 


OO yi 

IA(/)(0)l = 

n = 1 

which tends to infinity as r tends to 1, because y 1/n diverges. This 
gives the desired contradiction since 

I A-(/)(0)| < ^ f \hd)\P r (8)d6<sup \f(9 )\ , 

2tt J_ n e 

where P r {9) denotes the Poisson kernel discussed in the previous chapter. 

The sawtooth function is the object from which we will fashion our 
counter-example. We proceed as follows. For each N > 1 we define the 
following two functions on [ — 7r, 7 t] , 

pinQ _ pinO 

fN(o) = — and M 0 ) = — • 

l<|n<A' — .\'<n<— 1 

We contend that: 

(i) |/v(0)| > clog N. 

(ii) Jn{8) is uniformly bounded in N and 9. 

The first statement is a consequence of the fact that J2n = i V 71 ^ 
log IV, which is easily established (see also Figure 2): 


N 1 N ~ 1 /' n + 1 d-r 

yl>y / — 

• ^ n * I '-r 


n = 1 


n = 1 



log N. 


To prove (ii), we argue in the same spirit as in the proof of Tauber’s 
theorem, which says that if the series y c n is Abel summable to s and 
c n — o(l/n), then c n actually converges to s (see Exercise 14 in Chap- 
ter 2). In fact, the proof of Tauber’s theorem is quite similar to that of 
the lemma below. 


Lemma 2.3 Suppose that the Abel means A r = r n c n of the series 

Y^=\ c n ace bounded as r tends to 1 (with r < 1). If c n = 0(l/n), then 
the partial sums Sjy = Y(n=i c n are bounded. 
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Figure 2. Comparing a sum with an integral 


Proof. Let r = 1 — 1/N and choose M so that n|c n | < M. We esti- 
mate the difference 

N OO 

S N -A r = ^(Cn-r n Cn)- ^2 r7lc n 
n= 1 n=N-\-l 


as follows: 


\Sn ~ A r 


N 


OO 


< ^ |c n |(l - r") + r?l | °n 

n=l n=N+l 

N M oo 

<M^(l-r) + - £ r n 

n = 1 n=N-\-l 

M 1 

<MJV(l-r) + - — 

= 2 M, 


where we have used the simple observation that 

1 — r n — (1 — r)(l + r + • • • + r"^ 1 ) < n(l — r). 

So we see that if M satisfies both \A r \ < M and n|c n | < M, then \Sn\ < 
3 M. 

We apply the lemma to the series 

ginO 
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which is the Fourier series of the sawtooth function / used above. Here 
c n — e me /n + e~ me /{— n) for n^O, so clearly c n = 0(l/|n|). Finally, 
the Abel means of this series are A r (f)(9) = (/ * P r ){9). But / is bounded 
and P r is a good kernel, so SV(/)($) is uniformly bounded in N and 9 , 
as was to be shown. 

We now come to the heart of the matter. Notice that /jv and /jy 
are trigonometric polynomials of degree N (that is, they have non-zero 
Fourier coefficients only when \n\ < N). From these, we form trigono- 
metric polynomials Pn and Pn, now of degrees 3 N and 2 N — 1, by 
displacing the frequencies of /jy and Jn by 2 N units. In other words, 
we define Pn{9) = e' l( ' 2N ' ,e /n(9) and Pn(9) = e l ^ 2N ' ,e Jn(9). So while /jy 
has non- vanishing Fourier coefficients when 0 < |n| < N, now the coef- 
ficients of P N are non-vanishing for N < n < 3 N, n ^ 2N . Moreover, 
while n = 0 is the center of symmetry of /jv, now n = 2N is the center 
of symmetry of Pn- We next consider the partial sums Sm- 

Lemma 2.4 

f Pn if M > 3N , 

Sm(Pn) = \ Pn if M — 2N, 

[o if M < N. 

This is clear from what has been said above and from Figure 3. 

In (9) 


-NON 


e i (2N)e f N {6) = Pn(9) 


0 N 2N 3N 


S2N(e i(2N)e f N )(9) 
= e i(2N)e f N {6) 


0 N 2N 3N 


Figure 3. Breaking symmetry in Lemma 2.4 


The effect is that when M = 2 N, the operator Sm breaks the symme- 
try of Pn, but in the other cases covered in the lemma, the action of Sm 
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is relatively benign, since then the outcome is either Pat or 0. 

Finally, we need to find a convergent series of positive terms a k and 
a sequence of integers {N k } which increases rapidly enough so that: 

(i) -Wfc+i > 3N k , 

(ii) a k logJVfc — > oo as k — > oo. 

We choose (for example) a k — 1/fc 2 and N k — 3 2k which are easily seen 
to satisfy the above criteria. 

Finally, we can write down our desired function. It is 

OO 

/(0) = 5> fc P* fc (0). 

k= 1 

Due to the uniform boundedness of the Py (recall that IPwWI - |/iv(6»)|), 
the series above converges uniformly to a continuous periodic function. 
However, by our lemma we get 

|5 , 2iv m (/)(0)| > ca m log N m + 0(1) -> oo as m -> oo. 


-( ) — 

Nk- i 3iVfc_i 


-e- 

N k 


A 


^ — 

3 N k 


2 N k 


f : — 

N k + 1 


— *— 

3%i 


Figure 4. Symmetry broken in the middle interval (N k ,3N k ) 


Indeed, the terms that correspond to N k with k < m or k > m con- 
tribute 0(1) or 0, respectively (because the Pat’s are uniformly bounded), 
while the term that corresponds to N m is in absolute value greater than 
ca m log N m because \Pn(0)\ — |/jv( 0)| > clogA^. So the partial sums of 
the Fourier series of / at 0 are not bounded, and we are done since this 
proves the divergence of the Fourier series of / at 9 — 0. To produce a 
function whose series diverges at any other preassigned 6 — do, it suffices 
to consider the function f(6 — 8 0 ). 

3 Exercises 

1. Show that the first two examples of inner product spaces, namely and C d , 
are complete. 
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[Hint: Every Cauchy sequence in R has a limit.] 

2. Prove that the vector space £ 2 (Z) is complete. 

[Hint: Suppose A k = { ak in }nei. with fc = 1,2,... is a Cauchy sequence. Show 
that for each n, {a,k,n}kLi is a Cauchy sequence of complex numbers, therefore 
it converges to a limit, say b n . By taking partial sums of || — A w || and letting 

k' — * oo , show that \\Ak — B\\ — * 0 as k — > oo, where B = , b_i, b 0 , b±, . . .). 

Finally, prove that B £ £ 2 (Z).] 

3. Construct a sequence of integrable functions {/&} on [0, 27 t] such that 



but limfc^oo fk{9) fails to exist for any 6. 

[Hint: Choose a sequence of intervals Ik C [0, 2-7t] whose lengths tend to 0, and 
so that each point belongs to infinitely many of them; then let fk = Xi k ] 

4. Recall the vector space 1Z of integrable functions, with its inner product and 
norm 



(a) Show that there exist non-zero integrable functions / for which ||/|| = 0. 

(b) However, show that if / 6 77 with ||/|| = 0, then f(x) = 0 whenever / is 
continuous at x. 

(c) Conversely, show that if / £ 77 vanishes at all of its points of continuity, 


then ||/|| = 0. 


5. Let 


m = 


0 for 6 = 0 

log(l/0) for 0 < 6 < 2n, 


and define a sequence of functions in 77 by 


fn(0) 


0 for 0 < 6 < 1/n 
f(6) for 1/n < 9 < 2n. 


Prove that {f n }// =1 is a Cauchy sequence in 77. However, / does not belong to 
77. 
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[Hint: Show that J/ b (log 6) 2 dd — > 0 ifO < a < b and b — > 0, by using the fact that 
the derivative of #(log#) 2 — 26 log 9 + 26 is equal to (log 6>) 2 .] 

6. Consider the sequence {ak}^L_ oc defined by 

_ j 1/fc if fc > 1 
“ fc “ \ 0 if k < 0. 

Note that {dj,} G ^ 2 (Z), but that no Riemann integrable function has k th Fourier 
coefficient equal to a,k for all k. 

7. Show that the trigonometric series 

sin nx 

converges for every x, yet it is not the Fourier series of a Riemann integrable 
function. 

The same is true for ^ S1 °" x for 0 < a < 1, but the case 1/2 < a < 1 is more 
difficult. See Problem 1. 


y — 

t lop- 


i>2 


logn 


8. Exercise 6 in Chapter 2 dealt with the sums 


E — and 
n 2 

n odd >1 



n=l 


Similar sums can be derived using the methods of this chapter. 

(a) Let / be the function defined on [— 7r,7r] by f(9) = |0|. Use Parseval’s 
identity to find the sums of the following two series: 


OO 


E 

71=0 


1 

(2n+l) 4 


and 



71=1 


In fact, they are 7 t 4 /96 and 7r 4 /90, respectively. 

(b) Consider the 27r-periodic odd function defined on [0, w] by f(6) = 9(n — 6). 
Show that 


E 


71 = 0 


1 

(2n+ l) 6 


960 


and 



71=1 


Remark. The general expression when k is even for y//T 1 1 /n k in terms of n k 
is given in Problem 4. However, finding a formula for the sum X^T=i l/ n3 > or 
more generally l/n k with k odd, is a famous unresolved question. 
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9. Show that for a not an integer, the Fourier series of 

n c i(ir-x)a 

sin 7ra 


on [0, 27 t] is given by 


E 


ginx 

n + a 


Apply Parseval’s formula to show that 


E 


i 

(n + a) 2 


7T 


2 


(sin7ra) 2 


10. Consider the example of a vibrating string which we analyzed in Chapter 1. 
The displacement u{x, t ) of the string at time t satisfies the wave equation 

1 d 2 u d 2 u „ 

^dt 2= dx 2 ’ c ~ = T / p - 

The string is subject to the initial conditions 

du 

u(x,0) = f{x) and —(x,0) = g(x), 

where we assume that / £ C 1 and g is continuous. We define the total energy 
of the string by 


m 



The first term corresponds to the “kinetic energy” of the string (in analogy with 
(l/2)mn 2 , the kinetic energy of a particle of mass m and velocity v), and the 
second term corresponds to its “potential energy.” 

Show that the total energy of the string is conserved, in the sense that E(t) 
is constant. Therefore, 


m = e{ o) 


pj 9 {x) 2 dx+^rJ f'(x) 2 dx. 


11 . The inequalities of Wirtinger and Poincare establish a relationship between 
the norm of a function and that of its derivative. 
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(a) If / is T-periodic, continuous, and piecewise C 1 with fj' f(t ) dt = 0, show 
that 

pT rr i2 rT 

l \m\ 2 dt<^J o \f{t)\*dt , 

with equality if and only if f(t) = Asin(2nt/T) + Bcos(2nt/T). 
[Hint: Apply Parseval’s identity.] 

(b) If / is as above and g is just C 1 and T-periodic, prove that 


f{t)g{t) dt 


< 


T 2 

An 2 


\f(t)\ 2 dt / \g'(t)\ 2 dt. 


(c) For any compact interval [a, 6] and any continuously differentiable function 
/ with f(a) = f(b) = 0, show that 

J a ^ J a 

Discuss the case of equality, and prove that the constant (6 — a) 2 / n 2 can- 
not be improved. [Hint: Extend / to be odd with respect to a and periodic 
of period T = 2(b — a) so that its integral over an interval of length T is 
0. Apply part a) to get the inequality, and conclude that equality holds if 
and only if /(I) = Asin(7r|5f )]. 


POO 

/ sm x 7 r 

12. Prove that / ax — . 

Jo x 2 

[Hint: Start with the fact that the integral of D^(6) equals 27T, and note that 
the difference (1/ sin(0/2)) — 2/6 is continuous on [ — tt, 7t]. Apply the Riemann- 
Lebesgue lemma.] 

13. Suppose that / is periodic and of class C k . Show that 

/(n) = o(l/|n| fe ), 

that is, |n| fc /(n) goes to 0 as |n| — * oo. This is an improvement over Exercise 10 
in Chapter 2. 

[Hint: Use the Riemann-Lebesgue lemma.] 

14. Prove that the Fourier series of a continuously differentiable function / on 
the circle is absolutely convergent. 

[Hint: Use the Cauchy-Schwarz inequality and Parseval’s identity for /'.] 

15. Let / be 27r-periodic and Riemann integrable on [ — 7r, 7r] . 
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(a) Show that 


1 

2t r 


/_ 


7 r 


f(n) 


f(x + n/n)e mx dx 


— inx 


—tv 


hence 



/ 


'7T 


[f(x)-f(x + 7r/n)\e-^dx. 


—mx 


— TV 


(b) Now assume that / satisfies a Holder condition of order a, namely 


f(x + h) - f(x ) | < C\h 


for some 0 < a < 1, some C > 0, and all x, h. Use part a) to show that 


f(n) = 0(l/|n|“). 


(c) Prove that the above result cannot be improved by showing that the func- 
tion 



where 0 < a < 1, satisfies 


\f(x + h)~ f(x ) | < C\h 


and f(N) = 1/N a whenever N = 2 fc . 

[Hint: For (c), break up the sum as follows f(x + h) — f(x) = ^ 2 fc <i/|ft.| + 
5 Z^ 2 fe>i/|ft.| • estimate the first sum use the fact that |1 — e* e | < \0\ whenever 9 
is small. To estimate the second sum, use the obvious inequality |e l:r — e ty \ < 2.] 

16. Let / be a 27r-periodic function which satisfies a Lipschitz condition with 
constant K; that is, 


f(x) - f(y ) | < K\x - y\ for all x, y. 


This is simply the Holder condition with a = 1, so by the previous exercise, we 
see that f(n) = 0(l/|n|). Since the harmonic series ^ 1/n diverges, we cannot 
say anything (yet) about the absolute convergence of the Fourier series of /. The 
outline below actually proves that the Fourier series of / converges absolutely 
and uniformly. 
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(a) For every positive h we define gh{x) = f(x + h) — f(x — h). Prove that 


1 

2n 


\g h {x)\ 2 dx 


OO 

4| sinn/i| 2 |/(n)| 2 , 

71— — OO 


and show that 


OO 

E I sinnh| 2 |/(n)| 2 < A' 2 h 2 . 

n =— oo 


(b) Let p be a positive integer. By choosing h = 7t/ 2 p+1 , show that 


E i/(")i 2 < 

2P- 1 <|ra|<2P 


A' 2 7T 2 
2 2 P+! ’ 


(c) Estimate l/( n )lj anc ^ conclude that the Fourier series of / 

converges absolutely, hence uniformly. [Hint: Use the Cauchy-Schwarz 
inequality to estimate the sum.] 

(d) In fact, modify the argument slightly to prove Bernstein’s theorem: If / 
satisfies a Holder condition of order a > 1/2, then the Fourier series of / 
converges absolutely. 


17. If / is a bounded monotonic function on [ — 7r, 7 t] , then 

/(n) = 0(l/|n|). 

[Hint: One may assume that / is increasing, and say |/| < M. First check that 
the Fourier coefficients of the characteristic function of [a, b] satisfy 0(l/|n|). 
Now show that a sum of the form 

N 

El a kX[a k ,a k+ 1 ](x) 
k= 1 


with — 7r = a\ < d 2 < • • • < ajv < ujv+i = and — M < cki < • • ■ < ai v < M has 
Fourier coefficients that are 0(l/|n|) uniformly in N . Summing by parts one gets 
a telescopic sum — «*) which can be bounded by 2 M. Now approximate 

/ by functions of the above type.] 

18. Here are a few things we have learned about the decay of Fourier coefficients: 

(a) if / is of class C k , then f(n) = o(l/|n| fc ); 

(b) if / is Lipschitz, then f(n) = 0(l/|n|); 
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(c) if / is monotonic, then /(n) = 0(l/|n|); 

(d) if / is satisfies a Holder condition with exponent a where 0 < a < 1, then 
/(n) = 0(l/|n|“); 

(e) if / is merely Riemann integrable, then X^l/( n )| 2 < 00 and therefore 
fin) = o(l). 

Nevertheless, show that the Fourier coefficients of a continuous function can 
tend to 0 arbitrarily slowly by proving that for every sequence of nonnegative 
real numbers {e„} converging to 0, there exists a continuous function / such 
that |/(n)| > e n for infinitely many values of n. 

[Hint: Choose a subsequence {e nk } so that e nk < oo.] 


19 . Give another proof that the sum X^o<|ra|<jv e lnx / n is uniformly bounded in 
N and x G [ — 7r, 7r] by using the fact that 


1 x -\ e mx sinna; 

2 i n n 

0<|n|<iV n = 1 


{D N (t) - 1 )dt. 


where Djy is the Dirichlet kernel. Now use the fact that J 0 °° dt < oo which 
was proved in Exercise 12. 


20 . Let f(x) denote the sawtooth function defined by f(x) = ( 7 r — x)/2 on the 
interval (0, 2n) with /( 0) = 0 and extended by periodicity to all of R. The 
Fourier series of / is 


1 0 inx 00 

'W~5 EV-E 


2 i n * — J n 

7^0 n= 1 


and / has a jump discontinuity at the origin with 


/(° + ) = ^> /(° ) = - 7T and hence /(0+) - /(0 ) 


Show that 


7T f sin t 7T 

max Sjv(/)(i) ~o= —r dt ~ o> 

0<x<7r/iV 2 J q t Z 


which is roughly 9% of the jump 7r. This result is a manifestation of Gibbs’s 
phenomenon which states that near a jump discontinuity, the Fourier series of a 
function overshoots (or undershoots) it by approximately 9% of the jump. 

[Hint: Use the expression for Sjy(f) given in Exercise 19. J 
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1. For each 0 < a < 1 the series 
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E 


sin nx 


converges for every x but is not the Fourier series of a Riemann integrable func- 
tion. 

(a) If the conjugate Dirichlet kernel is defined by 

) 1 if n > 0 
0 if n = 0 
-1 if n < 0, 


then show that 


and 


D n (x) = 


cos(a;/2) — cos ((IV + l/2)x) 
sin(a;/2) 

\D]y(x)\ dx < clog TV. 


(b) As a result, if / is Riemann integrable, then 

(f*D N )(0) = OQogN). 


(c) In the present case, this leads to 


E^ = °( 1 °g iV )> 


which is a contradiction. 


2. An important fact we have proved is that the family {e lnx } n ez is orthonormal 
in 1Z and it is also complete, in the sense that the Fourier series of / converges 
to / in the norm. In this exercise, we consider another family possessing these 
same properties. 

On [—1,1] define 

d n 

L n{x) = ~^(. x 2 - V) n , n = 0, 1,2, — 

Then L n is a polynomial of degree n which is called the ?i th Legendre poly- 
nomial. 
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(a) Show that if / is indefinitely differentiable on [—1,1], then 



x 


,2 


1 ) n f {n \x) dx. 


In particular, show that L n is orthogonal to x m whenever m < n. Hence 
{L n }™ = 0 is an orthogonal family. 

(b) Show that 



[Hint: First, note that ||L n || 2 = (— l)”(2n)! f^ 1 (x 2 — l) n dx. Write 
(. x 2 — 1)" = (x — l) n (x + l) n and integrate by parts n times to calculate 
this last integral.] 

(c) Prove that any polynomial of degree n that is orthogonal to 1, x, a; 2 , ... , x ra_1 
is a constant multiple of L n . 

(d) Let C n = L n /||L„||, which are the normalized Legendre polynomials. Prove 
that {Cn\ is the family obtained by applying the “Gram-Schmidt process” 
to {1, x, . . . , x", . . .}, and conclude that every Riemann integrable function 
/ on [—1, 1] has a Legendre expansion 


OO 




71 = 0 


which converges to / in the mean-square sense. 

3. Let a be a complex number not equal to an integer. 

(a) Calculate the Fourier series of the 27r-periodic function defined on [— 7r, 7t] 
by f(x) = cos(aa;). 

(b) Prove the following formulas due to Euler: 



n 2 — a 2 2a 2 2atan(a7r) 


11 7T 


7T 


n= 1 



n= 1 


For all u G C — 7rZ, 
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(c) Show that for all a 6 C — Z we have 

n 2 — a 2 

7 r 

(cmt) 

[Hint: Split the integral as J + f^° and change variables t = 1/tt in the 
second integral. Now both integrals are of the form 

f 1 O ' -1 , 

X T Tt dt - 0<7<1 ’ 

which one can show is equal to Y^kLo ^+7 ■ Use part (c) to conclude the 
proof.] 


sin(a7r) 


= 1 + 2« 2 E 


(d) For all 0 < a < 1, show that 

-j-CX — 1 


/0 


t + 1 


dt = 


4. In this problem, we find the formula for the sum of the series 

00 

V- 

Z— / n k 

71=1 

where k is any even integer. These sums are expressed in terms of the Bernoulli 
numbers; the related Bernoulli polynomials are discussed in the next problem. 

Define the Bernoulli numbers B n by the formula 


t-> 

z = B n _ n 
e z — 1 n\ 

71 = 0 


(a) Show that B 0 = 1, B\ = —1/2, B 2 = 1/6, B 3 = 0, B 4 = —1/30, and 
B 5 = 0. 

(b) Show that for n > 1 we have 


B 


n 



Bk- 


(c) By writing 



71=2 
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show that B n = 0 if n is odd and > 1. Also prove that 

_°°_ 02 Tl TD 

zcotz = l + Y [ (-ir- ( ^r^. 

(d) The zeta function is defined by 

oo 

C(s) = X!^7’ for all s > 1. 

n = 1 

Deduce from the result in (c), and the expression for the cotangent func- 
tion obtained in the previous problem, that 

zcot^f^y 

/jY 2777 * 

771=1 


(e) Conclude that 


2C(2m) = (-1) 


ra+l (27T) 2 - 
(2m)! 


Bo 


5. Define the Bernoulli polynomials B n (x ) by the formula 

ze xz = B n (x) _ n 

e z — 1 n! 

n=0 

(a) The functions B n (x ) are polynomials in x and 



Show that B 0 (x) = 1, Bi(x) = x — 1/2, B 2 (x) = x 2 — x + 1/6, and 
B 3 (x) = x 3 — \x 2 + \x. 

(b) If n > 1, then 

B n (x + 1) - B n (x) = n* ra_1 , 

and if n > 2, then 

B n { 0) = B n ( 1) = B n . 

(c) Define S m (n) = l m + 2 m + ■ ■ ■ + (n — l) m . Show that 

(m -(- l)S' 7 7 i(u) = B m .^\. 
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(d) Prove that the Bernoulli polynomials are the only polynomials that satisfy 

(i) B 0 ( x) = 1, 

(ii) B' n (x ) = for n > 1, 

(iii) fg B n (x) dx = 0 for n > 1, and show that from (b) one obtains 

X + l 

B n (t) dt = x n . 

(e) Calculate the Fourier series of Bi(x) to conclude that for 0 < x < 1 we 
have 


Bi(x) = x — 1/2 


— 1 sin(27rfex) 

7T k 

k= 1 


Integrate and conclude that 


B 2 n{x) = (- 1 ) 




( 2») 2 


fc 2 " 


o (r \-( i \n+i 2 ( 2 ^ + 1)! y' sin(27rfca:) 
B2n+l(x) ( 1) (2n) 2n+1 £2n+l 

k= 1 


Finally, show that for 0 < x < 1, 


B n {x) 


n\ 

(27 Tl) n 


E 


^2-nikx 


We observe that the Bernoulli polynomials are, up to normalization, successive 
integrals of the sawtooth function. 
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Some Applications of Fourier 
Series 


Fourier series and analogous expansions intervene very 
naturally in the general theory of curves and surfaces. 
In effect, this theory, conceived from the point of view 
of analysis, deals obviously with the study of arbitrary 
functions. I was thus led to use Fourier series in sev- 
eral questions of geometry, and I have obtained in this 
direction a number of results which will be presented 
in this work. One notes that my considerations form 
only a beginning of a principal series of researches, 
which would without doubt give many new results. 

A. Hurwitz, 1902 


In the previous chapters we introduced some basic facts about Fourier 
analysis, motivated by problems that arose in physics. The motion of a 
string and the diffusion of heat were two instances that led naturally to 
the expansion of a function in terms of a Fourier series. We propose next 
to give the reader a flavor of the broader impact of Fourier analysis, and 
illustrate how these ideas reach out to other areas of mathematics. In 
particular, consider the following three problems: 

I. Among all simple closed curves of length t in the plane R 2 , which 
one encloses the largest area? 

II. Given an irrational number 7 , what can be said about the distri- 
bution of the fractional parts of the sequence of numbers nj, for 
n = 1,2,3,...? 

III. Does there exist a continuous function that is nowhere differen- 
tiable? 

The first problem is clearly geometric in nature, and at first sight, would 
seem to have little to do with Fourier series. The second question lies on 
the border between number theory and the study of dynamical systems, 
and gives us the simplest example of the idea of “ergodicity.” The third 
problem, while analytic in nature, resisted many attempts before the 
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solution was finally discovered. It is remarkable that all three questions 
can be resolved quite simply and directly by the use of Fourier series. 

In the last section of this chapter, we return to a problem that provided 
our initial motivation. We consider the time-dependent heat equation 
on the circle. Here our investigation will lead us to the important but 
enigmatic heat kernel for the circle. However, the mysteries surrounding 
its basic properties will not be fully understood until we can apply the 
Poisson summation formula, which we will do in the next chapter. 

1 The isoperimetric inequality 

Let r denote a closed curve in the plane which does not intersect itself. 
Also, let £ denote the length of T, and A the area of the bounded region 
in K 2 enclosed by F. The problem now is to determine for a given £ the 
curve F which maximizes A (if any such curve exists). 



Figure 1 . The isoperimetric problem 


A little experimentation and reflection suggests that the solution should 
be a circle. This conclusion can be reached by the following heuristic con- 
siderations. The curve can be thought of as a closed piece of string lying 
flat on a table. If the region enclosed by the string is not convex (for ex- 
ample) , one can deform part of the string and increase the area enclosed 
by it. Also, playing with some simple examples, one can convince oneself 
that the “flatter” the curve is in some portion, the less efficient it is in 
enclosing area. Therefore we want to maximize the “roundness” of the 
curve at each point. 

Although the circle is the correct guess, making the above ideas precise 
is a difficult matter. 

The key idea in the solution we give to the isoperimetric problem con- 
sists of an application of Parseval’s identity for Fourier series. However, 
before we can attempt a solution to this problem, we must define the 
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notion of a simple closed curve, its length, and what we mean by the 
area of the region enclosed by it. 

Curves, length and area 

A parametrized curve 7 is a mapping 

7 : [a, b] — > R 2 . 

The image of 7 is a set of points in the plane which we call a curve and 
denote by T. The curve F is simple if it does not intersect itself, and 
closed if its two end-points coincide. In terms of the parametrization 
above, these two conditions translate into 7(51) 7^ 7(82) unless si = a 
and S2 = b, in which case 7(a) = 7 (6). We may extend 7 to a periodic 
function on R of period b — a, and think of 7 as a function on the circle. 
We also always impose some smoothness on our curves by assuming that 
7 is of class C 1 , and that its derivative 7' satisfies 7 '(s) 7^ 0 . Altogether, 
these conditions guarantee that F has a well-defined tangent at each 
point, which varies continuously as the point on the curve varies. More- 
over, the parametrization 7 induces an orientation on F as the parameter 
s travels from a to b. 

Any C 1 bijective mapping s : [c, d] — > [a, b] gives rise to another 
parametrization of F by the formula 

V(t) = 7 (*(t))- 

Clearly, the conditions that T be closed and simple are independent of 
the chosen parametrization. Also, we say that the two parametrizations 
7 and 77 are equivalent if s' (t) > 0 for all t; this means that 77 and 7 
induce the same orientation on the curve F. If, however, s'(t) < 0 , then 
77 reverses the orientation. 

If F is parametrized by 7 (s) = (x(s), y(s)), then the length of the 
curve F is defined by 

£= f | 7 '( s )|ds=/ (x'(s) 2 +y'(s) 2 ) 1/2 ds. 

J a J a 

The length of F is a notion intrinsic to the curve, and does not depend 
on its parametrization. To see that this is indeed the case, suppose that 
7 (s(t)) = Tj(t). Then, the change of variables formula and the chain rule 
imply that 

[ \Y(s)\ds= f |Y(s(t))| |s'(t)|df= f \ri'{t)\dt, 

J a J c J c 
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as desired. 

In the proof of the theorem below, we shall use a special type of 
parametrization for F. We say that 7 is a parametrization by arc- 
length if |'y / (s) | = 1 for all s. This means that y(s) travels at a constant 
speed, and as a consequence, the length of F is precisely b — a. Therefore, 
after a possible additional translation, a parametrization by arc-length 
will be defined on [0,f]. Any curve admits a parametrization by arc- 
length (Exercise 1). 


We now turn to the isoperimetric problem. 

The attempt to give a precise formulation of the area A of the region 
enclosed by a simple closed curve T raises a number of tricky questions. 
In a variety of simple situations, it is evident that the area is given by 
the following familiar formula of the calculus: 


( 1 ) 


A = 


(xdy — y dx) 


x(s)y'(s) - y(s)x\s)ds 


see, for example, Exercise 3. Thus in formulating our result we shall 
adopt the easy expedient of taking (1) as our definition of area. This 
device allows us to give a quick and neat proof of the isoperimetric in- 
equality. A listing of issues this simplification leaves unresolved can be 
found after the proof of the theorem. 


Statement and proof of the isoperimetric inequality 

Theorem 1.1 Suppose that F is a simple closed curve in R 2 of length 
i , and let A denote the area of the region enclosed by this curve. Then 

A P 

A ~ 4t r’ 

with equality if and only ifT is a circle. 


The first observation is that we can rescale the problem. This means 
that we can change the units of measurement by a factor of 5 > 0 as 
follows. Consider the mapping of the plane R 2 to itself, which sends the 
point (x, y) to ( bx,by )• A look at the formula defining the length of a 
curve shows that if F is of length l , then its image under this mapping 
has length bl. So this operation magnifies or contracts lengths by a 
factor of b depending on whether b > 1 or b < 1. Similarly, we see that 
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the mapping magnifies (or contracts) areas by a factor of 5 2 . By taking 
5 = we see that it suffices to prove that if £ = 2n then A < n, with 
equality only if F is a circle. 

Let 7 : [0, 27t] — > IR 2 with j(s) = (x(s),y(s)) be a parametrization by 
arc-length of the curve F, that is, x'(s) 2 + y'(s) 2 = 1 for all s £ [0,27r]. 
This implies that 

1 ( 2 ’ k 

(2) —J o (x'(s) 2 + y\s) 2 )ds = l. 

Since the curve is closed, the functions x(s) and y(s) are 27r-periodic, so 
we may consider their Fourier series 

x (s ) ~ a ^ mS and V ( s ) ~ Y h nG ms - 

Then, as we remarked in the later part of Section 2 of Chapter 2, we 
have 


x'(s) ~ y ~' j a n ine tns and y'(s) ~ Y, b n ine ins . 

Parseval’s identity applied to (2) gives 


OO 

(3) Y |n | 2 (|a n | 2 + | 6 n | 2 ) = 1. 

n =— oo 


We now apply the bilinear form of Parseval’s identity (Lemma 1.5, Chap- 
ter 3) to the integral defining A. Since x(s) and y(s) are real-valued, we 
have a n = aZF and b n — b- n , so we find that 


^=5 


x(s)y'(s) - y(s)x'(s)ds 


OO 

E »( 


b n CL ri 


We observe next that 

(4) l^n^n ^n^n| — 2 |a n | \b n \ ^ \^n\ H - |^n| 5 

and since \n\ < |n| 2 , we may use (3) to get 

oo 

A < 7 T Y \n\ 2 (\a n \ 2 + \bn\ 2 ) 

n =— oo 

< 7T, 
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as desired. 

When A — 7r, we see from the above argument that 

x(s) = a_ie _ls + a 0 + aue* 5 and y(s) = 6_ \e~ ls + b 0 + b 1 e ls 

because \n\ < \n\ 2 as soon as |n| > 2. We know that x(s) and y(s) are 
real-valued, so a_i — aT and b-\ = b\. The identity (3) implies that 
2 (|ai| 2 + | i>i | 2 ) = 1, and since we have equality in (4) we must have 
|ai| = |6i| = 1/2. We write 

ai = - e ia and b\ — - e l/3 . 

The fact that 1 — 2\a\b\ — a\b\\ implies that | sin(a — 0)\ — 1, hence 
a — (3 = kir/2 where k is an odd integer. From this we find that 

x(s) = ao + cos(a + s) and y(s) = bo ± sin(a + s), 

where the sign in y(s) depends on the parity of (k — l)/2. In any case, 
we see that F is a circle, for which the case of equality obviously holds, 
and the proof of the theorem is complete. 

The solution given above (due to Hurwitz in 1901) is indeed very ele- 
gant, but clearly leaves some important issues unanswered. We list these 
as follows. Suppose F is a simple closed curve. 

(i) How is the “region enclosed by F” defined? 

(ii) What is the geometric definition of the “area” of this region? Does 
this definition accord with (1)? 

(iii) Can these results be extended to the most general class of sim- 
ple closed curves relevant to the problem — those curves which are 
“rectifiable” — that is, those to which we can ascribe a finite length? 

It turns out that the clarifications of the problems raised are connected 
to a number of other significant ideas in analysis. We shall return to 
these questions in succeeding books of this series. 

2 Weyl’s equidistribution theorem 

We now apply ideas coming from Fourier series to a problem dealing 
with properties of irrational numbers. We begin with a brief discussion 
of congruences, a concept needed to understand our main theorem. 
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The reals modulo the integers 

If x is a real number, we let [x] denote the greatest integer less than 
or equal to x and call the quantity [x] the integer part of x. The 
fractional part of x is then defined by (x) = x — [x] . In particular, 
(x) € [0,1) for every x € M. For example, the integer and fractional parts 
of 2.7 are 2 and 0.7, respectively, while the integer and fractional parts 
of —3.4 are —4 and 0.6, respectively. 

We may define a relation on IR by saying that the two numbers x and 
y are equivalent, or congruent, if x — y £ Z. We then write 

x = y mod Z or x = y mod 1 . 

This means that we identify two real numbers if they differ by an integer. 
Observe that any real number x is congruent to a unique number in 
[0, 1) which is precisely (x), the fractional part of x. In effect, reducing 
a real number modulo Z means looking only at its fractional part and 
disregarding its integer part. 

Now start with a real number 7^0 and look at the sequence 
7 , 2y, 37 , .... An intriguing question is to ask what happens to this 
sequence if we reduce it modulo Z, that is, if we look at the sequence of 
fractional parts 

(7), (27>, (37), •••• 

Here are some simple observations: 

(i) If 7 is rational, then only finitely many numbers appearing in (777) 
are distinct. 

(ii) If 7 is irrational, then the numbers ( 717 ) are all distinct. 

Indeed, for part (i), note that if 7 = p/q, the first q terms in the sequence 
are 


(p/q), (2 p/q), •••, ((q-l)p/q), (qp/q) = 0 . 

The sequence then begins to repeat itself, since 

((q + l)p/q) = (1 +p/q) = (p/q), 

and so on. However, see Exercise 6 for a more refined result. 

Also, for part (ii) assume that not all numbers are distinct. We there- 
fore have (7117) = (7127) for some 71 1 ^ 712; then 7717 — ?i 2 7 € Z, hence 7 
is rational, a contradiction. 
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In fact, it can be shown that if 7 is irrational, then (717) is dense in the 
interval [0,1), a result originally proved by Kronecker. In other words, 
the sequence (nj) hits every sub-interval of [0, 1) (and hence it does so 
infinitely many times) . We will obtain this fact as a corollary of a deeper 
theorem dealing with the uniform distribution of the sequence (717) . 

A sequence of numbers £1, £2, ■ • • , £n, • ■ ■ in [0, 1) is said to be equidis- 
tributed if for every interval (a, b) C [0, 1), 


lim 

TV— >-oo 


#{ 1 < n < N : £ n € (a, b)} _ & ^ 


where if A denotes the cardinality of the finite set A. This means that 
for large N, the proportion of numbers £ n in (a, b) with n < N is equal to 
the ratio of the length of the interval (a, b) to the length of the interval 
[0,1). In other words, the sequence £ n sweeps out the whole interval 
evenly, and every sub-interval gets its fair share. Clearly, the ordering of 
the sequence is very important, as the next two examples illustrate. 

Example 1. The sequence 


11212 



2 

5’ 


appears to be equidistributed since it passes over the interval [0,1) very 
evenly. Of course this is not a proof, and the reader is invited to give 
one. For a somewhat related example, see Exercise 8 with a — 1/2. 


Example 2. Let {r n }f ^ =1 be any enumeration of the rationals in [0, 1). 
Then the sequence defined by 

, _ f r n j 2 if n is even, 

\ 0 if n is odd, 

is not equidistributed since “half” of the sequence is at 0. Nevertheless, 
this sequence is obviously dense. 

We now arrive at the main theorem of this section. 

Theorem 2.1 7/7 is irrational, then the sequence of fractional parts 
(7), (27), (37), ... is equidistributed in [0, 1). 

In particular, (717) is dense in [0,1), and we get Kronecker’s theo- 
rem as a corollary. In Figure 2 we illustrate the set of points (7), (27), 
(37 ), . . . , (N 7) for three different values of N when 7 = y/2. 
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N = 10 

h • • • • ■ ■ • • H 

0 1 

N = 30 

K I 

0 1 


N = 80 



Figure 2. The sequence { 7 ), ( 27 ), ( 37 ), . . . , (Nj) when 7 = y/2 


Fix (a, b) C [0,1) and let X{a,b)( x ) denote the characteristic function 
of the interval (a, 6 ), that is, the function equal to 1 in (a, 6 ) and 0 in 
[0,1) — (a, 6 ). We may extend this function to R by periodicity (pe- 
riod 1), and still denote this extension by X(a,b){ x )• Then, as a conse- 
quence of the definitions, we find that 


N 

#{1 <n< N : ( 717 ) G (a, b)} = ^ X(a,f>)(™7 ), 

n = 1 


and the theorem can be reformulated as the statement that 


1 

N 


N 

^Zx{a,b){nx) 


71 = 1 


X(a,b){x) dx, 


as N 


00 . 


This step removes the difficulty of working with fractional parts and 
reduces the number theory to analysis. 

The heart of the matter lies in the following result. 

Lemma 2.2 If f is continuous and periodic of period 1, and 7 is irra- 
tional, then 


1 N 

71=1 

The proof of the lemma is divided into three steps. 

Step 1. We first check the validity of the limit in the case when / 
is one of the exponentials 1 , e 2mx , . . . , e 2ntkx : ... . If f — l, the limit 
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surely holds. If / = e 2nlkx with k A 0, then the integral is 0. Since 7 is 
irrational, we have e 2nlk ' y A 1 , therefore 


1 

N 


N 

n = 1 


g2itik'y ^ ^nikN'y 

N 1 — e 2nik-y ’ 


which goes to 0 as ./V — > 00 . 

Step 2 . It is clear that if / and g satisfy the lemma, then so does 
Af + Bg for any A,BeC. Therefore, the first step implies that the 
lemma is true for all trigonometric polynomials. 


Step 3. Let e > 0. If / is any continuous periodic function of period 1, 
choose a trigonometric polynomial P so that sup xeR | f(x) — P(x) \ < e/3 
(this is possible by Corollary 5.4 in Chapter 2). Then, by step 1, for all 
large N we have 


Therefore 
1 N 
n= 1 


N 


— p ( n i) - i p ( x ) dx 


n= 1 


< e/3. 


/■1 1 Aj _ 

/ f(x)dx < — J2 l/( n 7) - ^("7)1+ 
J ° * „=1 


N 


— P( n l) - / P(x) dx 


n = 1 


+ 


I P(x) - f{x) I dx 


< e, 


and the lemma is proved. 


Now we can finish the proof of the theorem. Choose two continuous 
periodic functions /+ and /~ of period 1 which approximate X{a,b)(x) 
on [0,1) from above and below; both /+ and /“ are bounded by 1 and 
agree with X(a,b)i x ) except in intervals of total length 2e (see Figure 3). 
In particular, fp(x) < X(a,b)( x ) < f+( x ), and 

b — a — 2e < f /” (x) dx and / / e + (x) dx < b — a + 2e. 

Jo Jo 

If S N = P Y,n=i X(a,b){nx), then we get 

N N 

( n 7) <S N <—J2 

n=l n=l 


Ibookroot October 20, 2007 


110 Chapter 4. SOME APPLICATIONS OF FOURIER SERIES 


ft 



Therefore 

b — a — 2e < liminf Sjy and limsup Sn < b — a + 2e. 

N ^oo A— >oo 

Since this is true for every e > 0, the limit linijv->oo SW exists and must 
equal b — a. This completes the proof of the equidistribution theorem. 

This theorem has the following consequence. 

Corollary 2.3 The conclusion of Lemma 2.2 holds for every function 
f which is Riemann integrable in [0, 1], and periodic of period 1. 

Proof. Assume / is real-valued, and consider a partition of the 
interval [0, 1], say 0 = xq < x\ < ■ ■ ■ < xn = 1. Next, define fu{x) = 

su Pzj-i<i/<*i f if x € \ x 3-^ x i) and /iW = inf Xj- X <y< xj f(y) for x G 
{Xj- i,Xj). Then clearly f L <f< fu and 

f fL(x)dx< f f (x) dx < f fu(x)dx. 

Jo Jo Jo 

Moreover, by making the partition sufficiently fine we can guarantee that 
for a given e > 0, 



/l(x) dx < e. 


1 

N 


N 




h(x) 


n= 1 


dx 


However, 
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by the theorem, because each Jl is a hnite linear combination of charac- 
teristic functions of intervals; similarly we have 



n= 1 


From these two assertions we can conclude the proof of the corollary by 
using the previous approximation argument. 

There is an interesting interpretation of the lemma and its corollary, 
in terms of a simple dynamical system. In this example, the underlying 
space is the circle parametrized by the angle 9. We also consider a 
mapping of this space to itself: here, we choose a rotation p of the circle 
by the angle 2irj, that is, the transformation p : 9 i— > 9 + 2777 . 

We want next to consider how this space, with its underlying action 
p , evolves in time. In other words, we wish to consider the iterates of p, 
namely p, p 2 , p 3 , . . ., p n where 


p n — popo---op: 0\-^8 + 27m7, 


and where we think of the action p n taking place at the time t — n. 

To each Riemann integrable function / on the circle, we can also asso- 
ciate the corresponding effects of the rotation p, and obtain a sequence 
of functions 


m,n P (8)),f(p 2 (8)),...,f(p n (e)),... 


with f(p n (9)) — f(9 + 2nnj). In this special context, the ergodicity of 
this system is then the statement that the “time average” 



exists for each 9 and equals the “space average” 



whenever 7 is irrational. In fact, this assertion is merely a rephrasing of 
Corollary 2.3, once we make the change of variables 9 — 2 - 7 ra;. 

Returning to the problem of equidistributed sequences, we observe that 
the proof of Theorem 2.1 gives the following characterization. 
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Weyl’s criterion. A sequence of real numbers 0 , £2 • ■ • 

[ 0 , 1 ) is equidistributed if and only if for all integers k 7 ^ 0 one 
has 

1 N 

— T e 27rlfe5n — > 0, as N — > 00 . 

n y 

n = 1 

One direction of this theorem was in effect proved above, and the con- 
verse can be found in Exercise 7. In particular, we find that to understand 
the equidistributive properties of a sequence f n , it suffices to estimate 
the size of the corresponding “exponential sum” f2n=i e 27Tlk ^ n . For ex- 
ample, it can be shown using Weyl’s criterion that the sequence (n 2 7 ) 
is equidistributed whenever 7 is irrational. This and other examples can 
be found in Exercises 8 , and 9; also Problems 2, and 3. 

As a last remark, we mention a nice geometric interpretation of the 
distribution properties of (ny). Suppose that the sides of a square are 
reflecting mirrors and that a ray of light leaves a point inside the square. 
What kind of path will the light trace out? 



Figure 4. Reflection of a ray of light in a square 


To solve this problem, the main idea is to consider the grid of the 
plane formed by successively reflecting the initial square across its sides. 
With an appropriate choice of axis, the path traced by the light in the 
square corresponds to the straight line P + (t, yf) in the plane. As a 
result, the reader may observe that the path will be either closed and 
periodic, or it will be dense in the square. The first of these situations 
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will happen if and only if the slope 7 of the initial direction of the light 
(determined with respect to one of the sides of the square) is rational. 
In the second situation, when 7 is irrational, the density follows from 
Kronecker’s theorem. What stronger conclusion does one get from the 
equidistribution theorem? 


3 A continuous but nowhere differentiable function 

There are many obvious examples of continuous functions that are not 
differentiable at one point, say f(x) — \x\. It is almost as easy to con- 
struct a continuous function that is not differentiable at any given finite 
set of points, or even at appropriate sets containing countably many 
points. A more subtle problem is whether there exists a continuous 
function that is nowhere differentiable. In 1861, Riemann guessed that 
the function defined by 


(5) 


R ( x ) = J2 


sin(n 2 x) 


n= 1 


n * 


was nowhere differentiable. He was led to consider this function because 
of its close connection to the theta function which will be introduced in 
Chapter 5. Riemann never gave a proof, but mentioned this example in 
one of his lectures. This triggered the interest of Weierstrass who, in an 
attempt to find a proof, came across the first example of a continuous but 
nowhere differentiable function. Say 0 < b < 1 and a is an integer > 1 . 
In 1872 he proved that if ab > 1 + 3n/2, then the function 


OO 

W(x) = ^ b n cos(a n x) 

n= 1 


is nowhere differentiable. 

But the story is not complete without a final word about Riemann’s 
original function. In 1916 Hardy showed that R is not differentiable at 
all irrational multiples of 7r, and also at certain rational multiples of n. 
However, it was not until much later, in 1969, that Gerver completely 
settled the problem, first by proving that the function R is actually 
differentiable at all the rational multiples of n of the form irp/q with p 
and q odd integers, and then by showing that R is not differentiable in 
all of the remaining cases. 

In this section, we prove the following theorem. 
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Theorem 3.1 If 0 < a < 1, then the function 

OO 

f a (x) = f(x) = Y^ 2 -™V 2 ”* 

n = 0 

is continuous but nowhere differentiable. 

The continuity is clear because of the absolute convergence of the se- 
ries. The crucial property of / which we need is that it has many van- 
ishing Fourier coefficients. A Fourier series that skips many terms, like 
the one given above, or like W(x), is called a lacunary Fourier series. 

The proof of the theorem is really the story of three methods of sum- 
ming a Fourier series. First, there is the ordinary convergence in terms 
of the partial sums = g * D^- Next, there is Cesaro summabil- 

ity crjv(ff) = g * Fn , with Fjv the Fejer kernel. A third method, clearly 
connected with the second, involves the delayed means defined by 


A N (g) = 2 (T 2N {g) - v N (g)- 

Hence A N (g) = g * [2 F 2 n ~ F N ], These methods can best be visualized 
as in Figure 5. 

Suppose g{x) ~ a n e mx . Then: 

• Sn arises by multiplying the term a n e mx by 1 if \n\ < N, and 0 if 
|n| > N. 

• ct n arises by multiplying a n e mx by 1 — \n\/N for \n\ < N and 0 for 
Ini > N. 


• Ajv arises by multiplying a n e mx by 1 if |n| < N, by 2(1 — |n|/(2AT)) 
for N < \n\ < 2 N, and 0 for |n| > 2 N. 

For example, note that 


VN(g){x) = 


S 0 (g)(x) + S!(g)(x) H h S N -i(g)(x) 


N 


N - 1 


= » a E 




Akx 


N 


£=0 \ k \<£ 

E ( w - 

\n\<N 

= E {'-'4 

|n|<JV x 


n a n e 


a n e 
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i 


-NON 
Partial sums 

Sn(9)(x) = E \n\<N a ne inx 
1 


-NON 
Cesaro means 

<7N(g)(x) = Em<iv i 1 - a ne" 



-2N -NO N 2N 


Delayed means 

A at ( 3) (a;) = 2cr 2 Ar(3)(a;) - a N (g)(x) 
Figure 5. Three summation methods 
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The proof of the other assertion is similar. 

The delayed means have two important features. On the one hand, 
their properties are closely related to the (good) features of the Cesaro 
means. On the other hand, for series that have lacunary properties like 
those of /, the delayed means are essentially equal to the partial sums. 
In particular, note that for our function / — f a 

(6) S N (f) = A N .(f), 

where N' is the largest integer of the form 2 k with N' < N. This is clear 
by examining Figure 5 and the definition of /. 

We turn to the proof of the theorem proper and argue by contradiction; 
that is, we assume that f{x o) exists for some xq. 

Lemma 3.2 Let g be any continuous function that is differentiable at 
xq. Then, the Cesaro means satisfy o’N(g)'{xo) — 0(logiV) ; therefore 

A N (g)'(x Q ) = 0(log N). 


Proof. First we have 

Fn(x o - t)g(t) dt — f F' N (t)g{x 0 - t) dt , 

J — 7T 


°N(g)'{x o) = 


where F^ is the Fejer kernel. Since Fry is periodic, we have ff (t)dt = 0 

and this implies that 

ON{g)'{x 0 ) = f F' N (t)[g{x 0 - t) - g(x 0 )]dt. 

From the assumption that g is differentiable at xq we get 
|ccv(sO'(a;o)| <C I \ F’ N (t)\ \t\ dt. 

J — 7T 

Now observe that F'n satisfies the two estimates 

\F' N {t)\<AN 2 and }F^(t)| < A 

For the first inequality, recall that Fn is a trigonometric polynomial 
of degree N whose coefficients are bounded by 1. Therefore, F' N is a 
trigonometric polynomial of degree N whose coefficients are no bigger 
than N. Hence \F'(t)\ < (2N + 1)N < AN 2 . 
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For the second inequality, we recall that 




1 sin 2 (iVi/2 ) 
N sin 2 (f/2) 


Differentiating this expression, we get two terms: 


sin(iVf/2) cos(Nt/2) 1 cos(f/2) sin 2 (IVf/2) 
sin 2 (t/2) N sin 3 (t/2) 


If we then use the facts that |sin(fVt/2)| < CN\t\ and |sin(t/2)| > c|t| 
(for |f| < 7 r), we get the desired estimates for F'nW- 
Using all of these estimates we find that 


a N (g)'(x 0 )| < C [ K(t)| \t\ dt + C f \F' N (t)\ \t\ dt 

J\t\>l/N J\t\<l/N 

<CA [ ~ + CAN f dt 

= 0(log N) + 0(1) 

= 0(log N). 


The proof of the lemma is complete once we invoke the definition of 


Lemma 3.3 If 2N — 2 n , then 

A 2N (f) ~ A N (f) = 2 _nQ e* 2 " x . 

This follows from our previous observation (6) because A 2 jv(/) = 
S2N(f) and A N {f) = S N (f). 

Now, by the first lemma we have 

A 2 w(/)'(* 0 ) - A iV (/) , (xo) = 0(log N), 

and the second lemma also implies 

|A 2 jv(/) , (xo) - A JV (/)'(x 0 )| = 2 n ^ > cN 1 ~ a . 

This is the desired contradiction since IV 1_ “ grows faster than log N. 

A few additional remarks about our function f a (x) = 2“ nc V 2 " x 

are in order. 
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This function is complex-valued as opposed to the examples R and W 
above, and so the nowhere differentiability of f a does not imply the same 
property for its real and imaginary parts. However, a small modification 
of our proof shows that, in fact, the real part of f a , 

OO 

J2^~ na cos 2 n x, 

n = 0 

as well as its imaginary part, are both nowhere differentiable. To see 
this, observe first that by the same proof, Lemma 3.2 has the following 
generalization: if g is a continuous function which is differentiable at xo, 
then 


A N (g)'(x 0 + h) — O(log N) whenever \h\ < c/N. 

We then proceed with F{x) = o 2~ na cos 2 n x, noting as above that 
A 2 n{F) ~ A n (F) — 2~ na cos 2 n x; as a result, assuming that F is differ- 
entiable at xq, we get that 

| 2 «(!-“) S in(2 n (x 0 + h))\ = 0(log N) 

when 2 N — 2 n , and \h\ < c/N. To get a contradiction, we need only 
choose h so that |sin(2 n (x 0 + h))\ — 1; this is accomplished by setting 
S equal to the distance from 2 n xo to the nearest number of the form 
( k + 1/2)7 r, 1 6 Z (so 8 < 7r/2), and taking h — ±5/2 n . 

Clearly, when a > 1 the function f a is continuously differentiable since 
the series can be differentiated term by term. Finally, the nowhere dif- 
ferentiability we have proved for a < 1 actually extends to a = 1 by a 
suitable refinement of the argument (see Problem 8 in Chapter 5). In 
fact, using these more elaborate methods one can also show that the 
Weierstrass function W is nowhere differentiable if ab > 1. 

4 The heat equation on the circle 

As a final illustration, we return to the original problem of heat diffusion 
considered by Fourier. 

Suppose we are given an initial temperature distribution at t = 0 on a 
ring and that we are asked to describe the temperature at points on the 
ring at times t > 0. 

The ring is modeled by the unit circle. A point on this circle is de- 
scribed by its angle 6 = 2nx, where the variable x lies between 0 and 1. 
If u(x, t) denotes the temperature at time t of a point described by the 
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angle d 1 then considerations similar to the ones given in Chapter 1 show 
that u satisfies the differential equation 

du d 2 u 

(7) 

The constant c is a positive physical constant which depends on the 
material of which the ring is made (see Section 2.1 in Chapter 1). After 
rescaling the time variable, we may assume that c — 1. If / is our initial 
data, we impose the condition 

u(x,0) = f(x). 

To solve the problem, we separate variables and look for special solutions 
of the form 


u(x, t) — A(x)B(t). 

Then inserting this expression for u into the heat equation we get 

B'(t) _ A"(x) 

B(t) A(x) 

Both sides are therefore constant, say equal to A. Since A must be 
periodic of period 1, we see that the only possibility is A = — 4-7T 2 n 2 , 
where n € Z. Then A is a linear combination of the exponentials e 2nmx 
and e -27rmx , and B(t) is a multiple of e~ 4n n t . By superposing these 
solutions, we are led to 

OO 

(8) u(x,t)= a n e~^ 2nH e 2ninx , 

n =— oo 

where, setting t = 0, we see that {a n } are the Fourier coefficients of /. 

Note that when / is Riemann integrable, the coefficients a n are 
bounded, and since the factor e _4?r n 4 tends to zero extremely fast, the 
series defining u converges. In fact, in this case, u is twice differentiable 
and solves equation (7). 

The natural question with regard to the boundary condition is the 
following: do we have u(x,t) — > f(x) as t tends to 0, and in what sense? 
A simple application of the Parseval identity shows that this limit holds 
in the mean square sense (Exercise 11). For a better understanding of 
the properties of our solution (8), we write it as 


u(x,t) = (f *H t )(x) 
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where H t is the heat kernel for the circle, given by 


OO 



n =— oo 


and where the convolution for functions with period 1 is defined by 



An analogy between the heat kernel and the Poisson kernel (of Chapter 2) 


is given in Exercise 12. However, unlike in the case of the Poisson kernel, 
there is no elementary formula for the heat kernel. Nevertheless, it turns 
out that it is a good kernel (in the sense of Chapter 2). The proof is 
not obvious and requires the use of the celebrated Poisson summation 
formula, which will be taken up in Chapter 5. As a corollary, we will 
also find that H t is everywhere positive, a fact that is also not obvious 
from its defining expression (9). We can, however, give the following 
heuristic argument for the positivity of H t . Suppose that we begin with 
an initial temperature distribution / which is everywhere < 0. Then it 
is physically reasonable to expect u(x, t) < 0 for all t since heat travels 
from hot to cold. Now 



If H t is negative for some xq, then we may choose / < 0 supported near 
xo, and this would imply u(xo,t) > 0, which is a contradiction. 

5 Exercises 

1. Let 7 : [a, b] — » R 2 be a parametrization for the closed curve T. 

(a) Prove that 7 is a parametrization by arc-length if and only if the length 
of the curve from 7 (a) to 7(s) is precisely s — a, that is, 



y(t)\ dt = s — a. 


(b) Prove that any curve T admits a parametrization by arc-length. [Hint: If 


77 is any parametrization, let h(s) = |77'(t)| dt and consider 7 = 77 o h h] 


2. Suppose 7: [a,b\ — > R 2 is a parametrization for a closed curve T, with 
7(1) = h(t),y(t)). 
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(a) Show that 


1 

2 


(x(s)y'(s) - y(s)x'{s))ds 


x(s)y'(s) ds 


y(s)x'(s) ds. 


(b) Dehne the reverse parametrization of 7 by 7 “ : [a, 6 ] — » R 2 with 
7 ~(t) =7 (b + a — t). The image of 7 “ is precisely T, except that the 
points 7 ~ (t) and 7 (t) travel in opposite directions. Thus 7 “ “reverses” 
the orientation of the curve. Prove that 


(xdy — y dx) 



ydx). 


In particular, we may assume (after a possible change in orientation) that 


A = 


(x(s)y'(s) - y(s)x'(s))ds = / x(s)y’(s)ds. 


3. Suppose T is a curve in the plane, and that there exists a set of coordinates 
x and y so that the a^-axis divides the curve into the union of the graph of 
two continuous functions y = f(x) and y = g( x) for 0 < x < 1 , and with f(x) > 
g(x) (see Figure 6 ). Let fl denote the region between the graphs of these two 
functions: 


^ = {(z, y) ■ 0 < X < 1 and g(x) <y< f{x)}. 



With the familiar interpretation that the integral f h(x ) dx gives the area 
under the graph of the function h, we see that the area of Q is f(x)dx — 
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fg g(x) dx. Show that this definition coincides with the area formula A given in 
the text, that is, 


/ f(x) dx- g{x) dx = 

Jo Jo 

Also, note that if the orientation of the curve is chosen so that 0 “lies to the 
left” of r, then the above formula holds without the absolute value signs. 

This formula generalizes to any set that can be written as a finite union of 
domains like f 1 above. 



4. Observe that with the definition of £ and A given in the text, the isoperimetric 
inequality continues to hold (with the same proof) even when T is not simple. 

Show that this stronger version of the isoperimetric inequality is equivalent 
to Wirtinger’s inequality, which says that if / is 27r-periodic, of class C 1 , and 
satisfies j ^ f(t) dt = 0, then 


|/(f)| 2 dt < / \f'{t)\ 2 dt 


with equality if and only if f(t) = Asint + B cos t (Exercise 11, Chapter 3). 

[Hint: In one direction, note that if the length of the curve is 2-7T and 7 is an 
appropriate arc-length parametrization, then 


2 (t r-A)= [x'(s)+y(s)] 2 ds+ (y' (s) 2 - y(s) 2 ) ds. 

Jo Jo 

A change of coordinates will guarantee J 2lr y(s) ds = 0. For the other direction, 
start with a real- valued / satisfying all the hypotheses of Wirtinger’s inequality, 
and construct g, 27r-periodic and so that the term in brackets above vanishes.] 


5. Prove that the sequence {'y n )^ =l , where 7„ is the fractional part of 

(¥)" 

is not equidistributed in [0,1]. 

[Hint: Show that U n = ^ 1+ 2 V ^ ^ ^ 1 ~ 2 V ^ ^ is the solution of the difference 

equation U r+ i = U r + U r - 1 with Uq = 2 and U\ = 1. The U n satisfy the same 
difference equation as the Fibonacci numbers.] 

6. Let 9 = p/q be a rational number where p and q are relatively prime inte- 
gers (that is, 9 is in lowest form). We assume without loss of generality that 
q > 0. Define a sequence of numbers in [0, 1) by = (n6) where (•) denotes the 


Ibookroot October 20, 2007 


5. Exercises 123 

fractional part. Show that the sequence {£ 1 , £ 2 , • ■ •} is equidistributed on the 
points of the form 

0, 1 /q, 2 /q, (q- 1 )/q. 

In fact, prove that for any 0 < a < q, one has 

#{n : 1 < n < TV, (n6) = a/q } = 1 Q ( ±_\ 

TV q \N J ' 

[Hint: For each integer k > 0, there exists a unique integer n with kq < n < {k + 
1 )q and so that (n0) = a/q. Why can one assume k = 0? Prove the existence 
of n by using the fact 1 that if p and q are relatively prime, there exist integers 
x, y such that xp + yq = 1. Next, divide TV by q with remainder, that is, write 
TV = iq + r where 0 < i and 0 < r < q. Establish the inequalities 

£ < jt={n : 1 < n < N, ( n6 ) = a/q} < l + 1.] 


7. Prove the second part of Weyl’s criterion: if a sequence of numbers £ 1 , £ 2 , - - ■ 
in [0, 1) is equidistributed, then for all k G Z — {0} 

1 N 

-Ve 2 "* 5 "-. 0 as TV ^ 00 . 

TV ^ 

n = 1 

[Hint: It suffices to show that -h Yln=i fi£n) fi x ) dx for all continuous /. 

Prove this hrst when / is the characteristic function of an interval.] 


8. Show that for any a/0, and a with 0 < a < 1, the sequence ( an a ) is equidis- 
tributed in [0, 1). 

[Hint: Prove that J2n = 1 e 27ribrl<T = 0(TV CT ) + 0{N 1 ~ a ) if 6 / 0.] In fact, note the 
following 


N 

^ ^ ^2'Kibn a 
n=l 



9. In contrast with the result in Exercise 8, prove that (alogn) is not equidis- 
tributed for any a. 

[Hint: Compare the sum e 27rli,lo s ra with the corresponding integral.] 

10 . Suppose that / is a periodic function on R of period 1, and {£„} is a sequence 
which is equidistributed in [0, 1). Prove that: 


lr The elementary results in arithmetic used in this exercise can be found at the begin- 
ning of Chapter 8. 
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fl 


(a) If / is continuous and satisfies f 0 f(x) dx = 0, then 


lim E' /(* + £„) = 0 uniformly in x. 
I — /V • 


N 


N — >oo N 


[Hint: Establish this result first for trigonometric polynomials.] 

(b) If / is merely integrable on [0, 1] and satisfies f* f(x) dx = 0, then 


lim 

N — ►oo 


N 




dx = 0. 


11 . Show that if u(x,t) = (/ * H t ){x) where H t is the heat kernel, and / is 
Riemann integrable, then 

u(x, t ) — f(x) | 2 dx — > 0 as t — > 0. 



12 . A change of variables in (8) leads to the solution 

u( 9 , r) = J 2 a n e- n 2 T e™ e = (/ * h T )( 6 ) 


of the equation 

du d 2 u 

7— - = 77-77 with 0 < 6 < 27 t and r > 0, 
or o 9 z 

with boundary condition u{ 9 , 0) = f( 9 ) ~ ^2 a n e zne ■ Here h T { 9 ) = 
En=-oo e ~ n ' TelnS - This version of the heat kernel on [0, 27 t] is the analogue 
of the Poisson kernel, which can be written as P r ( 9 ) = E^L-00 e~^ T e lrie with 
r = e~ T (and so 0 < r < 1 corresponds to r > 0). 

13 . The fact that the kernel H t (x) is a good kernel, hence u(x,t) — > f(x) at 
each point of continuity of /, is not easy to prove. This will be shown in the 
next chapter. However, one can prove directly that H t (x) is “peaked” at x = 0 
as t — * 0 in the following sense: 

(a) Show that f^ 1 ~j . 2 \Ht(x )\ 2 dx is of the order of magnitude of t -1 / 2 as t — > 0. 

More precisely, prove that t 1 / 2 “ 2 \H t (x )\ 2 dx converges to a non-zero 

limit as t — > 0. 

(b) Prove that x 2 \H t (x )\ 2 dx = 0 {t 1 / 2 ) as t — > 0. 
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[Hint: For (a) compare the sum 'Y^ ao e ~ cn2t with the integral e - ™ 2 * dx 
where c > 0. For (b) use x 2 < C/sin 7ra:) 2 for — l/2<x< 1/2, and apply the 
mean value theorem to e~ cx *.l 


6 Problems 

1. * This problem explores another relationship between the geometry of a curve 
and Fourier series. The diameter of a closed curve T parametrized by 
7 (t) = (x(t),y(t)) on [ — 7r, 7r] is defined by 

d= sup \P—Q\ = sup |7(ti) — 7(^2)!- 

p, Qer 

If a n is the n th Fourier coefficient of 7 (t) = x(t) + iyit) and l denotes the length 
of T, then 

(a) 2|a n | < d for all n / 0. 

(b) i < 7T d, whenever T is convex. 

Property (a) follows from the fact that 2a n = ^ f™ [j(t) — 7 (t + n /n)]e~' lnt dt. 

The equality i = nd is satished when T is a circle, but surprisingly, this is 
not the only case. In fact, one finds that i = nd is equivalent to 2|ai| = d. We 
re-parametrize 7 so that for each t in [— n, 7r] the tangent to the curve makes an 
angle t with the y-axis. Then, if a± = 1 we have 

i{t) = *e !t (l +r(t)), 

where r is a real-valued function which satisfies r(t) + r(t + n) = 0, and 
|r(t)| < 1. Figure 7 (a) shows the curve obtained by setting r(t ) = cos 5 1. Also, 
Figure 7 (b) consists of the curve where r(t) = /i(3t), with h(s) = — 1 if —tv < 
s < 0 and h(s) = 1 if 0 < s < n. This curve (which is only piecewise of class C 1 ) 
is known as the Reuleaux triangle and is the classical example of a convex curve 
of constant width which is not a circle. 

2. * Here we present an estimate of Weyl which leads to some interesting results. 

(a) Let Sn = X^=i e 27ri 7("). Show that for H < N, one has 

H N-h 

l-S/vl 2 < C— e 27ri(f{n+h) - f{n)) , 

h=0 n = 1 

for some constant c > 0 independent of N, H, and /. 

(b) Use this estimate to show that the sequence (n 2 7) is equidistributed in 
[0, 1) whenever 7 is irrational. 
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(a) 


(b) 


Figure 7. Some curves with maximal length for a given diameter 


(c) More generally, show that if {£„} is a sequence of real numbers so that 
for all positive integers h the difference (£ n +h — £ n ) is equidistributed in 
[0, 1), then (£„) is also equidistributed in [0, 1). 

(d) Suppose that P(x) = c n x n + ■ ■ ■ + Co is a polynomial with real coefficients, 
where at least one of ci, . . . , c n is irrational. Then the sequence ( P(n )) is 
equidistributed in [0, 1). 

[Hint: For (a), let a n = e 27 ™7(") w hen 1 < n < N and 0 otherwise. Then write 
H ^2 n a n = a, n +k and apply the Cauchy-Schwarz inequality. For (b), 

note that (n + h) 2 7 — n 2 y = 2nhy + h 2 7, and use the fact that for each integer 
h , the sequence (2n/i7) is equidistributed. Finally, to prove (d), assume first that 
P(x) = Q{x) + cix + Co where C\ is irrational, and estimate the exponential sum 
y _ 1 e 27rlfcp ( Il ) > Then, argue by induction on the highest degree term which has 
an irrational coefficient, and use part (c).] 

3. * If a > 0 is not an integer and a / 0, then (an a ) is equidistributed in [0, 1). 
See also Exercise 8. 

4. An elementary construction of a continuous but nowhere differentiable func- 
tion is obtained by “piling up singularities,” as follows. 

On [—1, 1] consider the function 


ip(x) = \x 


and extend tp to R by requiring it to be periodic of period 2. Clearly, tp is 
continuous on R and |<p(a ; )| < 1 for all x so the function / defined by 



is continuous on R. 
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(a) Fix Xq 6 M. For every positive integer m, let 5 m = ±|4 _m where the 
sign is chosen so that no integer lies in between A m Xo and 4’™(;eo + 5 m ). 
Consider the quotient 

_ ^"(zq + 5 m )) ~ <fi(4 n Xo) 


Prove that if n > m, then -y n = 0, and for 0 < n < m one has \^ n \ < 4™ 
with \j m \ = 4 m . 

(b) From the above observations prove the estimate 


f(x o + 5m) ~ f{x o) 

5 m 


>^(3 m + l), 


and conclude that / is not differentiable at Xq. 


5. Let / be a Riemann integrable function on the interval [ — 7r, 7 t] . We define 
the generalized delayed means of the Fourier series of / by 

_ Sn + ■ ■ ' + <SjV+7f-l 


Note that in particular 

oo,iv = o-jv, cr jv,i = Sn and <tn,n = Ajv, 

where Ajv are the specihc delayed means used in Section 3. 

(a) Show that 

&n,k = (( N + K)cjn+k ~ Non) , 

A 

and 

&n,k = S N + E /VK^- 

N+1<\v\<N+K-l ' ' 

From this last expression for <tn,k conclude that 


&n,k ~ S M \ < 


E i/» 

jv+i<H<iv+if-i 


for all N <M <N + K. 
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(b) Use one of the above formulas and Fejer’s theorem to show that with 
N = kn and K = n, then 


Vkn,n(f)( d ) -► /(0) as n -> 00 


whenever / is continuous at 6 , and also 


<7kn,n{f){0) 


f(0 + ) + f(0~) 
2 


as n — > oo 


at a jump discontinuity (refer to the preceding chapters and their exer- 
cises for the appropriate definitions and results). In the case when / is 
continuous on [ — 7r, 7 r], show that Ufc n>n (/) — » / uniformly as n — > oo. 

(c) Using part (a), show that if f(y) = 0{l/\v\) and kn < m < (k + 1 )n, we 
get 

(j 

Wkn n — S m I < — for some constant C > 0. 
fc 


(d) Suppose that f(v) = 0(l/|i/|). Prove that if / is continuous at 6 then 
S N {f){9)^m asiV^oo, 


and if / has a jump discontinuity at 6 then 


s N (fm 


f(e+) + f(d~) 
2 


as N — > oo. 


Also, show that if / is continuous on [ — 7r, 7r] , then Siy(f) — > / uniformly. 

(e) The above arguments show that if ^ c n is Cesaro sunnnable to s and c n = 
0(l/n), then ^ c n converges to s. This is a weak version of Littlewood’s 
theorem (Problem 3, Chapter 2). 


6. Dirichlet’s theorem states that the Fourier series of a real continuous peri- 
odic function / which has only a finite number of relative maxima and minima 
converges everywhere to / (and uniformly). 

Prove this theorem by showing that such a function satisfies f(n) = 0(l/|n|). 

[Hint: Argue as in Exercise 17, Chapter 3; then use conclusion (d) in Problem 5 
above.] 
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The Fourier Transform on M 


The theory of Fourier series and integrals has always 
had major difficulties and necessitated a large math- 
ematical apparatus in dealing with questions of con- 
vergence. It engendered the development of methods 
of summation, although these did not lead to a com- 
pletely satisfactory solution of the problem.. . . For the 
Fourier transform, the introduction of distributions 
(hence the space 5) is inevitable either in an explicit 
or hidden form.. . . As a result one may obtain all that 
is desired from the point of view of the continuity and 
inversion of the Fourier transform. 

L. Schwartz, 1950 


The theory of Fourier series applies to functions on the circle, or equiv- 
alently, periodic functions on R. In this chapter, we develop an analogous 
theory for functions on the entire real line which are non-periodic. The 
functions we consider will be suitably “small” at infinity. There are sev- 
eral ways of defining an appropriate notion of “smallness,” but it will 
nevertheless be vital to assume some sort of vanishing at infinity. 

On the one hand, recall that the Fourier series of a periodic function 
associates a sequence of numbers, namely the Fourier coefficients, to 
that function; on the other hand, given a suitable function / on R, the 
analogous object associated to / will in fact be another function / on R 
which is called the Fourier transform of /. Since the Fourier transform 
of a function on R is again a function on R, one can observe a symmetry 
between a function and its Fourier transform, whose analogue is not as 
apparent in the setting of Fourier series. 


Roughly speaking, the Fourier transform is a continuous version of the 
Fourier coefficients. Recall that the Fourier coefficients a n of a function 
/ defined on the circle are given by 


(1) 


= / f{x)e 


— 2'Kinx 


dx, 
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and then in the appropriate sense we have 

OO 

(2) /(*)= E *ne 2 ” inx . 

n=— oo 

Here we have replaced 9 by 2nx, as we have frequently done previously. 

Now, consider the following analogy where we replace all of the discrete 
symbols (such as integers and sums) by their continuous counterparts 
(such as real numbers and integrals). In other words, given a function / 
on all of R, we define its Fourier transform by changing the domain of 
integration from the circle to all of R, and by replacing n € Z by £ £ R 
in (1), that is, by setting 

/ OO 

f(x)e- 2 ^dx. 

-OO 

We push our analogy further, and consider the following continuous ver- 
sion of (2): replacing the sum by an integral, and a n by /(C), leads to 
the Fourier inversion formula, 

/ OO 

-oo 

Under a suitable hypotheses on /, the identity (4) actually holds, and 
much of the theory in this chapter aims at proving and exploiting this 
relation. The validity of the Fourier inversion formula is also suggested 
by the following simple observation. Suppose / is supported in a finite 
interval contained in / = [-L/2, L/ 2] , and we expand f in a Fourier series 
on I. Then, letting L tend to infinity, we are led to (4) (see Exercise 1). 

The special properties of the Fourier transform make it an important 
tool in the study of partial differential equations. For instance, we shall 
see how the Fourier inversion formula allows us to analyze some equations 
that are modeled on the real line. In particular, following the ideas 
developed on the circle, we solve the time-dependent heat equation for 
an infinite rod and the steady-state heat equation in the upper half- plane. 

In the last part of the chapter we discuss further topics related to the 
Poisson summation formula, 

E /( n ) = E /( n )’ 

n£Z n£Z 


which gives another remarkable connection between periodic functions 
(and their Fourier series) and non-periodic functions on the line (and 
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their Fourier transforms). This identity allows us to prove an assertion 
made in the previous chapter, namely, that the heat kernel Ht(x) satisfies 
the properties of a good kernel. In addition, the Poisson summation 
formula arises in many other settings, in particular in parts of number 
theory, as we shall see in Book II. 

We make a final comment about the approach we have chosen. In our 
study of Fourier series, we found it useful to consider Riemann integrable 
functions on the circle. In particular, this generality assured us that even 
functions that had certain discontinuities could be treated by the theory. 
In contrast, our exposition of the elementary properties of the Fourier 
transform is stated in terms of the Schwartz space S of testing functions. 
These are functions that are indefinitely differentiable and that, together 
with their derivatives, are rapidly decreasing at infinity. The reliance on 
this space of functions is a device that allows us to come quickly to the 
main conclusions, formulated in a direct and transparent fashion. Once 
this is carried out, we point out some easy extensions to a somewhat 
wider setting. The more general theory of Fourier transforms (which 
must necessarily be based on Lebesgue integration) will be treated in 
Book III. 

1 Elementary theory of the Fourier transform 

We begin by extending the notion of integration to functions that are 
defined on the whole real line. 

1.1 Integration of functions on the real line 

Given the notion of the integral of a function on a closed and bounded 
interval, the most natural extension of this definition to continuous func- 
tions over R is 



'OO 


f(x ) dx — lim 
N— >oo 


Of course, this limit may not exist. For example, it is clear that if 


f(x) = 1, or even if f(x) — 1/(1 + |x|), then the above limit is infinite. 


A moment’s reflection suggests that the limit will exist if we impose on 
/ enough decay as |x| tends to infinity. A useful condition is as follows. 

A function / defined on R is said to be of moderate decrease if / 
is continuous and there exists a constant A > 0 so that 



A 


for all x € R. 


,2 
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This inequality says that / is bounded (by A for instance) , and also that 
it decays at infinity at least as fast as 1/x 2 , since A/( 1 + x 2 ) < A/x 2 . 

For example, the function fix) = 1/(1 + |a:| n ) is of moderate decrease 
as long as n> 2. Another example is given by the function e~ a 1*1 for 
a > 0. 

We shall denote by Af(M) the set of functions of moderate decrease 
on R. As an exercise, the reader can check that under the usual addition 
of functions and multiplication by scalars, A1(R) forms a vector space 
over C. 

We next see that whenever / belongs to A1(R), then we may define 

/ oo fN 

f{x) dx = lim / f(x) dx, 

-oo N->ooJ_ N 

where the limit now exists. Indeed, for each N the integral In — 
J N n f(x) dx is well defined because / is continuous. It now suffices to 
show that {In} is a Cauchy sequence, and this follows because if M > N, 
then 


I Im — In I < 


' N<\x\<M 


f{x) dx 


< A 


I N<\x\<M X 


dx 
~ 2 


2 A 

< — — > 0 as N — > oo. 
“ N 


Notice we have also proved that Jjj >JV f(x) dx — > 0 as N — > oo. At this 
point, we remark that we may replace the exponent 2 in the definition 
of moderate decrease by 1 + e where e > 0; that is, 

|/(x)| < , for all x € R. 

u v yl “ 1 + |x| 1+e 

This definition would work just as well for the purpose of the theory 
developed in this chapter. We chose e = 1 merely as a matter of conve- 
nience. 

We summarize some elementary properties of integration over R in a 
proposition. 

Proposition 1.1 The integral of a function of moderate decrease defined 
by ( 5) satisfies the following properties: 
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(i) Linearity: if f,g € .M(IR) and a, b € C, then 

/ oo roo /*oo 

( af(x) + bg(x))dx = a / f(x)dx+b / g(x) dx. 
-oo J — oo J —oo 

(ii) Translation invariance: for every h £ M we have 

/ oo /»oo 

/( x — h ) dx — / /(x) dx. 

-oo J —oo 

(iii) Scaling under dilations: if 6 > 0, dierc 

/ oo /»oo 

/(dx) dx= f{x) dx. 

-oo J — oo 

(iv) Continuity: if f £ A4(IR), t/ien 

/ OO 

j/(x — h) — f{x ) | dx — ► 0 as h — > 0. 

-OO 
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We say a few words about the proof. Property (i) is immediate. To 
verify property (ii), it suffices to see that 



h) dx 


r N 


l-N 


f(x) dx — > 0 as N — ► oo. 


Since fix — h) dx = f N N h h fix) dx, the above difference is majorized 
by 


r — N 


rN 

/ fi x ) dx 

+ 

/ /(x) dx 

J-N-h 


JN-h 


for large N, which tends to 0 as N tends to infinity. 

The proof of property (iii) is similar once we observe that <5 f _ N /(dx) dx 

S-m f( x ) dx - 

To prove property (iv) it suffices to take \h\ < 1. For a preassigned e > 0, 
we first choose N so large that 


l\x\>N 


I f (x) | dx < e/4 


and 



h)\ dx < e/4. 


Now with N fixed, we use the fact that since / is continuous, it is uni- 
formly continuous in the interval [— N — 1, N + !]• Hence 
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sup i^i < | f(x — h) — f{x) | — > 0 as h tends to 0. So we can take h, so 
small that this supremum is less than e/4 N. Altogether, then, 

/ oo pN 

\f(x - h) - f{x)\dx < / \f(x - h) - f(x)\dx 

-oo J-N 

+ I | f{x-h)\dx 

J\x\>N 

+ [ \f( x ) \ dx 

J\x\>N 

< e/2 + e/4 + e/4 = e, 
and thus conclusion (iv) follows. 

1.2 Definition of the Fourier transform 

If / £ A4(M), we define its Fourier transform for £ £ K. by 

/ OO 

f(x)e~ 2 ™tdx. 

-OO 

Of course, \ e ~ 2ntx £\ = 1, so the integrand is of moderate decrease, and 
the integral makes sense. 

In fact, this last observation implies that / is bounded, and moreover, 
a simple argument shows that / is continuous and tends to 0 as |£| — > oo 
(Exercise 5). However, nothing in the definition above guarantees that 
/ is of moderate decrease, or has a specific decay. In particular, it is not 
clear in this context how to make sense of the integral /(£)e 27rlx ^ d£ 
and the resulting Fourier inversion formula. To remedy this, we introduce 
a more refined space of functions considered by Schwartz which is very 
useful in establishing the initial properties of the Fourier transform. 

The choice of the Schwartz space is motivated by an important prin- 
ciple which ties the decay of / to the continuity and differentiability 
properties of / (and vice versa): the faster /(£) decreases as |£| — > oo, 
the “smoother” / must be. An example that reflects this principle is 
given in Exercise 3. We also note that this relationship between / and / 
is reminiscent of a similar one between the smoothness of a function on 
the circle and the decay of its Fourier coefficients; see the discussion of 
Corollary 2.4 in Chapter 2. 

1.3 The Schwartz space 

The Schwartz space on IR consists of the set of all indefinitely differ- 
entiable functions / so that / and all its derivatives /O, . . 
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are rapidly decreasing, in the sense that 

sup |x| fe |/^(x)| < oo for every k,i> 0. 

We denote this space by S = <S(R), and again, the reader should verify 
that <S(R) is a vector space over C. Moreover, if / £ <S(R), we have 

f'(x) — -j- £ <S(R) and xf{x ) £ <S(R). 

This expresses the important fact that the Schwartz space is closed under 
differentiation and multiplication by polynomials. 

A simple example of a function in <S(R) is the Gaussian defined by 

f(x) = e~ x \ 

which plays a central role in the theory of the Fourier transform, as well 
as other fields (for example, probability theory and physics). The reader 
can check that the derivatives of / are of the form P(x)e~ x where P is 
a polynomial, and this immediately shows that / £ <S(R). In fact, e~ ax 
belongs to <S(M) whenever a > 0. Later, we will normalize the Gaussian 
by choosing a = tt. 



An important class of other examples in <S(R) are the “bump func- 
tions” which vanish outside bounded intervals (Exercise 4). 

As a final remark, note that although e - ^ decreases rapidly at infinity, 
it is not differentiable at 0 and therefore does not belong to <S(R). 
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1.4 The Fourier transform on S 

The Fourier transform of a function / £ <S(R) is defined by 

/ OO 

f(x)e~ 2 ™tdx. 

-OO 

Some simple properties of the Fourier transform are gathered in the fol- 
lowing proposition. We use the notation 


/(*) — m 


to mean that / denotes the Fourier transform of /. 
Proposition 1.2 7//£<S(R) then: 


(i) f{x + h) - 

— * f{0 e27rlh 5 whenever h £ R. 

(ii) f(x)e 27Vlxh — > /(£ + h ) whenever h £ R 

(iii) f{5x) — » 

<5 1 f(S x £) whenever S > 0. 

(iv) f'(x) — > 

2t 


d '■ 

(v) —2nixf(x 

' — > w/ £ • 
df 


In particular, except for factors of 27 tz, the Fourier transform inter- 
changes differentiation and multiplication by x. This is the key property 
that makes the Fourier transform a central object in the theory of differ- 
ential equations. We shall return to this point later. 

Proof. Property (i) is an immediate consequence of the translation 
invariance of the integral, and property (ii) follows from the definition. 
Also, the third property of Proposition 1.1 establishes (iii). 

Integrating by parts gives 



/ 0*0 


D — 2-rrix^ 


N 


-N 


i-N 


+ 2nif / f(x)e 2mx € dx, 


l-N 


so letting N tend to infinity gives (iv). 

Finally, to prove property (v), we must show that f is differentiable 
and find its derivative. Let e > 0 and consider 


m + h)-m 

h 


(-2t rixf)(f)= 


/0)e 


— 27 rix£ 


„—2'Kixh 


- 1 


27 rix 


dx. 
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Since /( x) and xf{ x) are of rapid decrease, there exists an integer N 
so that Jj x | >jv |/(x)| dx < e and /| x | >Ar |ic| \f(x)\ dx < e. Moreover, for 
\x\ < N. there exists ho so that \h\ < ho implies 


r .—2ivixh 


- 1 


27 TlX 



Hence for |h| < ho we have 


m 

- (-27 Tixf)(f) 

j-N 



J-N 

f(x)e~ 2nix t 

e 


< 

< C'e. 


n — 2 , Kixh 


- 1 


2nix 


dx + Ce 


Theorem 1.3 If f e 5(R), then f £ 5(IR). 

The proof is an easy application of the fact that the Fourier transform 
interchanges differentiation and multiplication. In fact, note that if / € 
<S(R), its Fourier transform / is bounded; then also, for each pair of 
non- negative integers t and k, the expression 



is bounded, since by the last proposition, it is the Fourier transform of 

The proof of the inversion formula 

/ OO 

f{f)e 2 ^df for f£S(R), 

-oo 

which we give in the next section, is based on a careful study of the 
2 

function e~ ax ~ , which, as we have already observed, is in <S(R) if a > 0. 
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The Gaussians as good kernels 

We begin by considering the case a = it because of the normalization: 

r°° 2 

(6) / e~ nx dx = 1. 

• 7—00 


To see why (6) is true, we use the multiplicative property of the expo- 
nential to reduce the calculation to a two-dimensional integral. More 
precisely, we can argue as follows: 



/ °o roc 

/ e~^ x+y) dxdy 
-oo J — oo 


' — OO *7 — OO 
r2ir rOC 


70 <7 0 


r dr dd 


2irre 


dr 


— e 


OO 

0 


= 1 , 


where we have evaluated the two-dimensional integral using polar coor- 
dinates. 

The fundamental property of the Gaussian which is of interest to us, 

2 

and which actually follows from (6), is that e _7rar equals its Fourier 
transform! We isolate this important result in a theorem. 

Theorem 1.4 If f(x) = e -7 ™ 2 , then /(£) = /(C)- 

Proof. Define 

/ °° 2 

e-™ e~ 2 ^dx, 

-OO 

and observe that F( 0) = 1, by our previous calculation. By property (v) 
in Proposition 1.2, and the fact that f'(x) = —2nxf(x), we obtain 

/ OO rO O 

f{x){-2TTix)e~ 2ni ^ dx=i f'{x)e~ 27Tix t dx. 

-oo J — oo 

By (iv) of the same proposition, we find that 


F\0 = i(27T*0/(0 = -27 rCF(C). 
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If we define G(£) = F(f)e n ^ 2 , then from what we have seen above, it 
follows that G'(f) = 0, hence G is constant. Since F{ 0) = 1, we conclude 
that G is identically equal to 1, therefore F(f) = e _7rS , as was to be 
shown. 

The scaling properties of the Fourier transform under dilations yield 
the following important transformation law, which follows from (iii) in 
Proposition 1.2 (with 5 replaced by 5 -1 / 2 ). 

Corollary 1.5 If 5 > 0 andK$(x ) — <5 -1 / 2 e -7ra;2 /' 5 ; then Ks(0 — e~ nS ^ 2 . 

We pause to make an important observation. As <5 tends to 0, the 
function K$ peaks at the origin, while its Fourier transform K$ gets 
flatter. So in this particular example, we see that K$ and K$ cannot both 
be localized (that is, concentrated) at the origin. This is an example of a 
general phenomenon called the Heisenberg uncertainty principle, which 
we will discuss at the end of this chapter. 

We have now constructed a family of good kernels on the real line, 
analogous to those on the circle considered in Chapter 2. Indeed, with 

K s (x) = <r 1/2 e-™ 2/5 , 


we have: 


(i) lZo K s( x )dx = 1 . 

(ii) f-oo \K s (x)\dx < M. 

(iii) For every ry > 0, we have f\ x \ >v |lf ( s(a;)| dx — > 0 as <5 — > 0. 

To prove (i), we may change variables and use (6), or note that the 
integral equals K$( 0), which is 1 by Corollary 1.5. Since K$ > 0, it is 
clear that property (ii) is also true. Finally we can again change variables 
to get 





dy 


0 


as 5 tends to 0. We have thus proved the following result. 


Theorem 1.6 The collection {/ia} 5 >o is a family of good kernels 
as 5 — > 0. 


We next apply these good kernels via the operation of convolution, 
which is given as follows. If f,g£ <S(M), their convolution is defined by 

/ OO 

f(x - t)g(t) dt. 

-OO 


( 7 ) 
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For a fixed value of x, the function f(x — t)g(t) is of rapid decrease in t, 
hence the integral converges. 

By the argument in Section 4 of Chapter 2 (with a slight modification) , 
we get the following corollary. 

Corollary 1.7 If f E <S(K) , then 

(/ * I\s)(x) — > f[x) uniformly in x as S — > 0 . 

Proof. First, we claim that / is uniformly continuous on M. Indeed, 
given e > 0 there exists R > 0 so that |/(x)| < e/4 whenever \x\ > R. 
Moreover, / is continuous, hence uniformly continuous on the compact 
interval [— R, R] , and together with the previous observation, we can find 
77 > 0 so that | f(x) — f(y)\ < e whenever \x — y\ < 77 . Now we argue as 
usual. Using the first property of good kernels, we can write 

/ OO 

Ks{t) [f(x ~ t) - f(x)} dt, 

-OO 

and since I\s > 0 , we find 

\(f * Ks)(x) — f(x)\ < ( +[ K s {t)\f(x -t) - f(x)\dt. 

I\t\>v *7 Ul <»7 

The first integral is small by the third property of good kernels, and the 
fact that / is bounded, while the second integral is also small since / 
is uniformly continuous and / K$ — 1. This concludes the proof of the 
corollary. 


1.5 The Fourier inversion 

The next result is an identity sometimes called the multiplication for- 
mula. 

Proposition 1.8 If f,g € <S(R), then 

/ °° r°° 

f( x )g(x) dx — / f(y)g(y)dy. 

-00 J — OO 

To prove the proposition, we need to digress briefly to discuss the inter- 
change of the order of integration for double integrals. Suppose F(x, y) 
is a continuous function in the plane (x,y) £ IR 2 . We will assume the 
following decay condition on F: 


\F{x,y)\ < A/{1 +x 2 ){l + y 2 ). 
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Then, we can state that for each x the function F(x, y ) is of moderate 
decrease in y, and similarly for each fixed y the function F(x, y) is of 
moderate decrease in x. Moreover, the function F\(x) = jy. F(x, y ) dy 
is continuous and of moderate decrease; similarly for the function F 2 fy) = 
f- 00 F(x,y)dx. Finally 

/ OO pOO 

Fi(x) dx — / F 2 (y) dy. 

-oo J — OO 


The proof of these facts may be found in the appendix. 

We now apply this to F(x, y) = f(x)g(y)e~ 2mxy . Then F\{x) — 
f(x)g(x), and F 2 (y) = /(y)y(y) so 


f{x)g(x) dx 



f(y)g(y)dy, 


which is the assertion of the proposition. 

The multiplication formula and the fact that the Gaussian is its own 
Fourier transform lead to a proof of the first major theorem. 

Theorem 1.9 (Fourier inversion) If f G <S(M), then 

/ OO 

m)e 2 ^dt 

-oo 


Proof. We first claim that 

/ OO 

KOdf. 

-oo 

Let Gs(x) — e~ nSx2 so that Gs{£) = K$(£). By the multiplication for- 
mula we get 

/ oo poo 

f(x)Ks(x) dx = / fiOGsiOdf,- 

-oo J — oo 

Since K$ is a good kernel, the first integral goes to /( 0) as <5 tends to 0. 
Since the second integral clearly converges to /(£) d£ as 5 tends to 0, 
our claim is proved. In general, let F(y) = f{y + x) so that 

/ oo poo 

F(€)d$= / my™* d£. 

-oo J —oo 


As the name of Theorem 1.9 suggests, it provides a formula that inverts 
the Fourier transform; in fact we see that the Fourier transform is its own 
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inverse except for the change of x to — x. More precisely, we may define 
two mappings T : S (R) — > S (R) and IF* : S (R) — > S (R) by 

/ oo roo 

f(x)e- 2 ^dx and F'(g)(x) = gffie 2 *** d$. 
-oo J — OO 

Thus J- is the Fourier transform, and Theorem 1.9 guarantees that 
IF* o T = I on <S(R), where I is the identity mapping. Moreover, since 
the definitions of IF and IF* differ only by a sign in the exponential, we 
see that !F(f)(y) = IF* (f)(—y), so we also have IF o IF* = /. As a conse- 
quence, we conclude that T* is the inverse of the Fourier transform on 
<S(R), and we get the following result. 

Corollary 1.10 The Fourier transform is a bijective mapping on the 
Schwartz space. 

1.6 The Plancherel formula 

We need a few further results about convolutions of Schwartz functions. 
The key fact is that the Fourier transform interchanges convolutions with 
pointwise products, a result analogous to the situation for Fourier series. 

Proposition 1.11 If f,g £ S( R) then: 

(i) f*g£ S{ R). 

(ii) f*g = g*f. 

(iii) (/*0)(O = /(Off(O- 

Proof. To prove that f * g is rapidly decreasing, observe first that for 
any I > 0 we have sup^, |x| £ |< 7 (a; — y)\ < Ae( 1 + \y\) e , because g is rapidly 
decreasing (to check this assertion, consider separately the two cases 
|x| < 2\y\ and |a?| > 2\y\). From this, we see that 

/ OO 

l/(y)l(i + \y\) e dy, 

-oo 

so that x e (f * g){x) is a bounded function for every l > 0. These esti- 
mates carry over to the derivatives of f * g, thereby proving that 
f * g £ S (R) because 

(H) </ * = (/ * (f) »K*> for k = 1 , 2, 
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This identity is proved first for k = 1 by differentiating under the inte- 
gral defining f * g. The interchange of differentiation and integration is 
justified in this case by the rapid decrease of dg/dx. The identity then 
follows for every k by iteration. 

For fixed x, the change of variables x — y = u shows that 





This change of variables is a composition of two changes, y i— > — y and 
y i— > y — h (with h = x). For the first one we use the observation that 
fZ F(x) dx = J'Z 0 F(—x) dx for any Schwartz function F, and for the 
second, we apply (ii) of Proposition 1.1 

Finally, consider F(x, y) = f{y)g{x — y)e~ 2mx ^. Since / and g are 
rapidly decreasing, considering separately the two cases |ar| < 2\y\ and 
x > 2\y\, we see that the discussion of the change of order of integration 
after Proposition 1.8 applies to F. In this case F±(x) — (/ * g)(x)e ^ 27Tlx ^ , 


and F 2 {y) = f(y)e 2niyi g(f). Thus fZ F^x) dx = fZ F 2 {y) dy, which 


implies (iii). The proposition is therefore proved. 

We now use the properties of convolutions of Schwartz functions to 
prove the main result of this section. The result we have in mind is the 
analogue for functions on R of Parseval’s identity for Fourier series. 

The Schwartz space can be equipped with a Hermitian inner product 



whose associated norm is 



The second major theorem in the theory states that the Fourier transform 
is a unitary transformation on <S(M). 


Theorem 1.12 (Plancherel) If f E <S(M) then ||/|| = ||/||. 


Proof. If / € <S(M) define f b (x) = Then f b (f) — /{£)■ Now 

let h = f * / b . Clearly, we have 
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The theorem now follows from the inversion formula applied with x = 0, 
that is, 



1.7 Extension to functions of moderate decrease 

In the previous sections, we have limited our assertion of the Fourier 
inversion and Plancherel formulas to the case when the function involved 
belonged to the Schwartz space. It does not really involve further ideas to 
extend these results to functions of moderate decrease, once we make the 
additional assumption that the Fourier transform of the function under 
consideration is also of moderate decrease. Indeed, the key observation, 
which is easy to prove, is that the convolution f * g of two functions / and 
g of moderate decrease is again a function of moderate decrease (Exer- 
cise 7); also f * g — fg. Moreover, the multiplication formula continues 
to hold, and we deduce the Fourier inversion and Plancherel formulas 
when / and / are both of moderate decrease. 

This generalization, although modest in scope, is nevertheless useful 
in some circumstances. 

1.8 The Weierstrass approximation theorem 

We now digress briefly by further exploiting our good kernels to prove 
the Weierstrass approximation theorem. This result was already alluded 
to in Chapter 2. 

Theorem 1.13 Let f be a continuous function on the closed and bounded 
interval [a, b] C M. Then, for any e > 0, there exists a polynomial P such 
that 


sup \f(x) - P(x)\ <e. 


x(E [a,b\ 


In other words, f can be uniformly approximated by polynomials. 

Proof. Let [— M, M ] denote any interval that contains [a, 6] in its 
interior, and let g be a continuous function on M that equals 0 outside 
[— M, M ] and equals / in [a, b\. For example, extend / as follows: from b 
to M define g by a straight line segment going from f(b) to 0, and from 
a to — M by a straight line segment from /(a) also to 0. Let B be a 
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bound for g , that is, \g(x)\ < B for all x. Then, since {A^} is a family of 
good kernels, and g is continuous with compact support, we may argue 
as in the proof of Corollary 1.7 to see that g * K$ converges uniformly 
to g as 5 tends to 0. In fact, we choose So so that 

jg(x) — (g * IQ 0 )(x)l < e/2 for all x E R. 


Now, we recall that e x is given by the power series expansion e x — 
x n /n\ which converges uniformly in every compact interval of R. 
Therefore, there exists an integer N so that 

1^0 (*) - ^ for all X e [— 2M, 2 M] 

where R{x) = Sq 1 ^ 2 J2n = o • Then, recalling that g vanishes 

outside the interval [— M, M], we have that for all x € [— M, M\ 



j-M 

(g*R)(x)\ - 

/ g(t) [K 5o (x - t) - R(x - t)] dt 


J-M 


j-M 

< 

1 \g(t)\ \Ks 0 (x — t) — R(x — t)\ dt 

J 

-M 

<2 MB sup \K$ a (z) — R{z)\ 


ze[-2M,2M] 


< e/2. 


Therefore, the triangle inequality implies that \g(x) — (g * R)(x)\ <e 
whenever x € [— M, M ], hence | f(x) — (g * A)(x)| < e when x € [a, b\. 

Finally, note that g * R is a polynomial in the x variable. Indeed, by 
definition we have ( g * R){ x) = f M M g(t)R(x — t) dt, and R(x — t) is a 
polynomial in x since it can be expressed, after several expansions, as 
R.(x — t) = Y2, n a n {t)x n where the sum is finite. This concludes the proof 
of the theorem. 


2 Applications to some partial differential equations 

We mentioned earlier that a crucial property of the Fourier transform 
is that it interchanges differentiation and multiplication by polynomials. 
We now use this crucial fact together with the Fourier inversion theorem 
to solve some specific partial differential equations. 

2.1 The time-dependent heat equation on the real line 

In Chapter 4 we considered the heat equation on the circle. Here we 
study the analogous problem on the real line. 
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Consider an infinite rod, which we model by the real line, and suppose 
that we are given an initial temperature distribution f(x) on the rod 
at time t = 0. We wish now to determine the temperature u(x,t) at 
a point x at time t > 0. Considerations similar to the ones given in 
Chapter 1 show that when u is appropriately normalized, it solves the 
following partial differential equation: 


( 8 ) 


du d 2 u 
dt = fa 2 ’ 


called the heat equation. The initial condition we impose is 
u(x, 0) = f(x). 

Just as in the case of the circle, the solution is given in terms of a 
convolution. Indeed, define the heat kernel of the line by 


'Ht{x) = K$(x), with 5 = Airt, 

so that 

Wt(-T) = (4 Jty/ 2 e ~ x2/4t and ^ t(c) = 

Taking the Fourier transform of equation (8) in the x variable (for- 
mally) leads to 

= — 4 t T 2 i 2 u{i,t). 

Fixing £, this is an ordinary differential equation in the variable t (with 
unknown u(f, ■)), so there exists a constant A(f) so that 

u(£,t) = A(^)e~ 4n2 ^ 2t . 


We may also take the Fourier transform of the initial condition and obtain 
ff(£,0) = /(£), hence A(£) = /(C)- This leads to the following theorem. 

Theorem 2.1 Given f G <S(M), let 

u(x,t) — (/ *H t )(x) for t > 0 


where Tit is the heat kernel. Then: 

(i) The function u is C 2 when and t > 0, and u solves the heat 

equation. 


Ibookroot October 20, 2007 


2. Applications to some partial differential equations 


147 


(ii) u(x, t) — > f(x) uniformly in x as t — > 0. Hence if we set u(x, 0) = 
f(x), then u is continuous on the closure of the upper half-plane 

E^_ = {(x,t) '■ t> 0}. 

(iii) ff^ | u(x,t) — f(x)\ 2 dx ^ 0 as t — > 0. 

Proof. Because u = f * Tit, taking the Fourier transform in the x- 
variable gives u = fH t , and so u(f, t) = /(£)e _47r 5 t . The Fourier inver- 
sion formula gives 

/ °° 

f(0e~ 4n « e^de- 

-oo 

By differentiating under the integral sign, one verifies (i). In fact, one 
observes that u is indefinitely differentiable. Note that (ii) is an imme- 
diate consequence of Corollary 1.7. Finally, by Plancherel’s formula, we 
have 

/ °° r°° 

\u{x, t) - f(x) I 2 dx= K£, t) - f(f) I 2 df 
-oo J — oo 

/ oo 

\m 2 \e- in * -1 \d£. 

-oo 

To see that this last integral goes to 0 as t — > 0, we argue as follows: 
since \e~ 4n2t ^ — 1| < 2 and / € <S(M), we can find N so that 



1| df, < e, 


and for all small t we have sup^| <JV \f{f)\ 2 \e 4?r2i S 2 — 1| < e/2 N since / 
is bounded. Thus 



1| df < e for all small t. 


This completes the proof of the theorem. 


The above theorem guarantees the existence of a solution to the heat 
equation with initial data /. This solution is also unique, if uniqueness 
is formulated appropriately. In this regard, we note that u = f * hit, 
f G 5(E), satisfies the following additional property. 

Corollary 2.2 u(-,t ) belongs to 5(E) uniformly in t, in the sense that 
for any T > 0 


sup 

x e e 

0 < t < T 


d i 

dx e 


u(x, t) 


(9) 


< oo for each k,£> 0. 


Ibookroot October 20, 2007 


148 


Chapter 5. THE FOURIER TRANSFORM ON R 


Proof. This result is a consequence of the following estimate: 



< / \f(x-y)\n t (y)dy + 


M<M / 2 


M>M / 2 


\f(x - y)\H t {y)dy 



Indeed, since / is rapidly decreasing, we have | ffx — y)\ < Cn/{ 1 + 1^1)^ 
when \y\ < \x\/2. Also, if \y\ > \x\/2 then TLtfy) < Ct~ x / 2 e~ cx ^ /* , and we 
obtain the above inequality. Consequently, we see that u(x, t ) is rapidly 
decreasing uniformly for 0 < t < T. 

The same argument can be applied to the derivatives of u in the x 
variable since we may differentiate under the integral sign and apply the 
above estimate with / replaced by /', and so on. 

This leads to the following uniqueness theorem. 

Theorem 2.3 Suppose u(x,t ) satisfies the following conditions: 

(i) u is continuous on the closure of the upper half-plane. 

(ii) u satisfies the heat equation for t > 0. 

(iii) u satisfies the boundary condition u(x, 0) = 0. 

(iv) uf,t ) € <S(M) uniformly in t, as in (9). 

Then, we conclude that u = 0. 

Below we use the abbreviations d^u and d t u to denote d i u/dx t ' and 
du/dt, respectively. 

Proof. We define the energy at time t of the solution u(x, t ) by 



Clearly Eft) > 0. Since E{ 0) = 0 it suffices to show that E is a de- 
creasing function, and this is achieved by proving that dE/dt <0. The 
assumptions on u allow us to differentiate Eft) under the integral sign 


dE 

dt 


/ [dtufx, t)u(x , t) + u(x, t)dtu(x, t)] dx 

J R 


But u satisfies the heat equation, therefore d t u = dfu and d t u = dfu, so 
that after an integration by parts, where we use the fact that u and its 
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x derivatives decrease rapidly as |x| — > oo, we find 


dE 

dt 


/ [d 2 u(x,t)u(x,t) + u(x 1 t)d 2 u(x,t)] dx 

Jr 

— / [d x ii(x,t)d x u(x,t) + d x u(x,t)d x u(x,t)\ dx 

Jr 

—2 I \d x ii(x, t)\ 2 dx 

Jr 


< 0 , 


as claimed. Thus E{t) — 0 for all t, hence u — 0. 

Another uniqueness theorem for the heat equation, with a less restric- 
tive assumption than (9), can be found in Problem 6. Examples when 
uniqueness fails are given in Exercise 12 and Problem 4. 


2.2 The steady-state heat equation in the upper half-plane 

The equation we are now concerned with is 


( 10 ) 


d 2 u d 2 u 
U ~dx 2 + d^~ 


in the upper half-plane = {(x, y) : x € M, y > 0}. The boundary con- 
dition we require is u(x, 0) = f{x). The operator A is the Laplacian and 
the above partial differential equation describes the steady-state heat dis- 
tribution in subject to u — f on the boundary. The kernel that solves 
this problem is called the Poisson kernel for the upper half-plane, and 
is given by 


1 y 

V v {x) — where x e K. and y > 0. 

7 t x z + y z 

This is the analogue of the Poisson kernel for the disc discussed in Sec- 
tion 5.4 of Chapter 2. 

Note that for each fixed y the kernel V y is only of moderate decrease 
as a function of x , so we will use the theory of the Fourier transform 
appropriate for these types of functions (see Section 1.7). 

We proceed as in the case of the time-dependent heat equation, by 
taking the Fourier transform of equation (10) (formally) in the x variable, 
thereby obtaining 


-4tt 2 £ 2 u(£, y) + 


d 2 u 

dy 2 


(Z,V) = 0 
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with the boundary condition u(f, 0) = /'(C)- The general solution of this 
ordinary differential equation in y (with £ fixed) takes the form 

&(£,y) - A(f)e~ 2n ^ v + B{f)e 2n ^ v . 

If we disregard the second term because of its rapid exponential increase 
we find, after setting y — 0, that 


Therefore u is given in terms of the convolution of / with a kernel whose 
Fourier transform is e~ 2 ' K ^ v . This is precisely the Poisson kernel given 
above, as we prove next. 


Lemma 2.4 The following two identities hold: 

/ OO 

e~ 2 ^\ y e 2 ^ x df = V y ( X ), 

-OO 

/ OO 

V y (x)e- 2nix ^dx = e- 2n ^ v . 

-OO 


Proof. The first formula is fairly straightforward since we can split 
the integral from — oo to 0 and 0 to oo. Then, since y > 0 we have 


e -27r&, e 2 d . c _ 


J2vi{x+iy)^ ^ _ 


1 


e 2m(x+iy)i 

2ni(x + iy ) 


2ni(x + iy) 


and similarly, 


r 0 1 

/ e 2 ^ v e 2 df = 7 T . 

/-oo 2ni(x - iy) 


Therefore 


; -2vr|g|t/ e 27rifa; _ 


2ni(x — iy) 2iri(x + iy) i r(x 2 + y 2 ) 


The second formula is now a consequence of the Fourier inversion theorem 
applied in the case when / and f are of moderate decrease. 


Lemma 2.5 The Poisson kernel is a good kernel on IR. as y — > 0. 
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Proof. Setting £ = 0 in the second formula of the lemma shows that 
fZ 'Py(x) dx — 1, and clearly V y (x) > 0, so it remains to check the last 
property of good kernels. Given a fixed S > 0, we may change variables 
u = x/y so that 



r°° du 

IS/y 1 + U 2 


[arctanu]^ = 7r/2 — arctan (5/y), 


and this quantity goes to 0 as y — > 0. Since V y {x) is an even function, 
the proof is complete. 

The following theorem establishes the existence of a solution to our 
problem. 

Theorem 2.6 Given f £ S( M), let u(x,y) — (/ *V y ){x). Then: 

(i) u(x,y) is C 2 in and A u = 0. 

(ii) u(x,y ) — > f(x) uniformly as y — > 0. 

(iii) fZ \u(x,y) - f(x)\ 2 dx -> 0 as y -> 0. 

(iv) If u(x, 0) = f{x), then u is continuous on the closure of the 
upper half-plane, and vanishes at infinity in the sense that 


u(x, y) — > 0 as |x| + y — > oo. 


Proof. The proofs of parts (i), (ii), and (iii) are similar to the case of 
the heat equation, and so are left to the reader. Part (iv) is a consequence 
of two easy estimates whenever / is of moderate decrease. First, we have 


\(f*Vy)(x)\<C 


1 


+ 


(1 + x 2 ) x 2 + y 


which is proved (as in the case of the heat equation) by splitting the 
integral fZ f( x ~ 0 A W dt into the part where |i| < \x\/2 and the part 
where \t\ > \x\/2. Also, we have |(/ * V y ){x)\ < C/y, since sup x V y (x) < 

c/y- 

Using the first estimate when \x\ > \y\ and the second when |x| < \y\ 
gives the desired decrease at infinity. 

We next show that the solution is essentially unique. 

Theorem 2.7 Suppose u is continuous on the closure of the upper half- 
plane satisfies A u — 0 for ( x , y) £ IR^_, u(x, 0) = 0, and u(x, y) van- 

ishes at infinity. Then u— 0. 
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A simple example shows that a condition concerning the decay of u at 
infinity is needed: take u(x,y) = y. Clearly u satisfies the steady-state 
heat equation and vanishes on the real line, yet u is not identically zero. 

The proof of the theorem relies on a basic fact about harmonic func- 
tions, which are functions satisfying A u = 0. The fact is that the value 
of a harmonic function at a point equals its average value around any 
circle centered at that point. 

Lemma 2.8 (Mean-value property) Suppose is an open set in K 2 
and let u be a function of class C 2 with A u = 0 in 0. If the closure of 
the disc centered at ( x , y ) and of radius R is contained in Q., then 

1 [ 27T 

u(x,y) — —— / u{x + r cos 9, y + r sin#) dQ 
27t Jo 

for all 0 < r < R. 


Proof. Let U(r, 9) = u(x + r cos 9, y + rsin 9). Expressing the Lapla- 
cian in polar coordinates, the equation A u — 0 then implies 


0 = 


d 2 U d 

~W + r ~ih 



If we define F(r) — / Q 27r U(r, 9) d9, the above gives 


d ( dF\ 
T ~dr V ¥ ) 


1 r 2n 

27T Jo 


d 2 U 

~W 


(r, 9) d9. 


The integral of d 2 U/d9 2 over the circle vanishes since dU/d9 is peri- 
odic, hence (r^) = 0, and consequently rdF/dr must be constant. 
Evaluating this expression at r = 0 we find that dF/dr — 0. Thus F is 
constant, but since F( 0) = u(x, y), we finally find that F(r) = u(x, y) for 
all 0 < r < R, which is the mean-value property. 

Finally, note that the argument above is implicit in the proof of The- 
orem 5.7, Chapter 2. 

To prove Theorem 2.7 we argue by contradiction. Considering sepa- 
rately the real and imaginary parts of u, we may suppose that u itself 
is real-valued, and is somewhere strictly positive, say u(xo,yo) > 0 for 
some xo G E and yo > 0. We shall see that this leads to a contradiction. 
First, since u vanishes at infinity, we can find a large semi-disc of ra- 
dius R, = {(x, y) : x 2 + y 2 < R, y > 0} outside of which u(x, y) < 
A«(xq, yo). Next, since u is continuous in D^, it attains its maximum 
M there, so there exists a point (aq, y\) € D ^ with u{x\,y{) = M, while 
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u(x, y) < M in the semi-disc; also, since u(x, y) < \u(x o, yo) < M/2 out- 
side of the semi-disc, we have u(x, y) < M throughout the entire upper 
half-plane. Now the mean- value property for harmonic functions implies 

1 f 27T 

u(xi,yi) = — / u(xi + p cos 6,yi + p sin 6) d6 
Jo 

whenever the circle of integration lies in the upper half-plane. In par- 
ticular, this equation holds if 0 < p < y\. Since u{x\,yi) equals the 
maximum value M, and u{x\ + pcos8,yi + p sin#) < M, it follows by 
continuity that u(x i + pcos6,yi + psin6) = M on the whole circle. For 
otherwise u(x,y) < M — e, on an arc of length 5 > 0 on the circle, and 
this would give 

1 r 2n / , eS 

— / u(xi + pcosO, y i + p sin 6) dd < M — — < M , 

27T J 0 2-7T 

contradicting the fact that u(x\, y\) — M. Now letting p — > j/i, and using 
the continuity of u again, we see that this implies u(x i , 0) = M > 0, 
which contradicts the fact that u(x, 0) = 0 for all x. 


3 The Poisson summation formula 

The definition of the Fourier transform was motivated by the desire for 
a continuous version of Fourier series, applicable to functions defined 
on the real line. We now show that there exists a further remarkable 
connection between the analysis of functions on the circle and related 
functions on R. 

Given a function / 6 5(M) on the real line, we can construct a new 
function on the circle by the recipe 

OO 

Fi(x) = f( x + n )- 

n =— oo 

Since / is rapidly decreasing, the series converges absolutely and uni- 
formly on every compact subset of R, so F\ is continuous. Note that 
F\{x + 1) = F\(x) because passage from n to n + 1 in the above sum 
merely shifts the terms on the series defining F\ ( x ) . Hence F\ is periodic 
with period 1. The function F\ is called the periodization of /. 

There is another way to arrive at a “periodic version” of /, this time 
by Fourier analysis. Start with the identity 

poo 

f(x)= / /(?)e 2 ” €x ^, 


— OO 
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and consider its discrete analogue, where the integral is replaced by a 
sum 

OO 

F 2 (x)= J2 /(«>“• 


Once again, the sum converges absolutely and uniformly since f belongs 
to the Schwartz space, hence F 2 is continuous. Moreover, F 2 is also 
periodic of period 1 since this is the case for each one of the exponentials 

g2ninx 

The fundamental fact is that these two approaches, which produce F\ 
and F 2 . actually lead to the same function. 

Theorem 3.1 (Poisson summation formula) If f 6 <S(R), then 

OO OO 

E /(* + «) = E /We 2 ™"- 


In particular, setting x = 0 we have 


OO OO 

E /w = H /W- 

n =— oo n =— oo 


In other words, the Fourier coefficients of the periodization of / are 
given precisely by the values of the Fourier transform of / on the integers. 

Proof. To check the first formula it suffices, by Theorem 2.1 in 
Chapter 2, to show that both sides (which are continuous) have the 
same Fourier coefficients (viewed as functions on the circle). Clearly, the 
m th Fourier coefficient of the right-hand side is f{m). For the left-hand 
side we have 



oo r i 


E / f( x+n ) e 


— 2n imx 


dx 


n=— oo 

00 rn -\- 1 


^ rn- 1-1 

E / f(y)e~ 2nimv dy 

_ ^ J n 


n=— oo 
r oo 


f {y) e ~ 2nimv dy 


f(™), 


where the interchange of the sum and integral is permissible since / is 
rapidly decreasing. This completes the proof of the theorem. 
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We observe that the theorem extends to the case when we merely 
assume that both f and / are of moderate decrease; the proof is in fact 
unchanged. 

It turns out that the operation of periodization is important in a 
number of questions, even when the Poisson summation formula does 
not apply. We give an example by considering the elementary function 
f(x) = 1/x, x 7 ^ 0. The result is that Y^=-oo l/(ai + n), when summed 
symmetrically, gives the partial fraction decomposition of the cotangent 
function. In fact this sum equals 7 rcot 7 rx, when x is not an integer. 
Similarly with f(x) = 1/x 2 , we get Y^=-oo V( x + n ) 2 — 7 r 2 /(sin 7 rx) 2 , 
whenever x (/ Z (see Exercise 15). 


3.1 Theta and zeta functions 

We define the theta function i ?(s) for s > 0 by 

OO 

d(s) = e~™ 2s - 

n =— oo 

The condition on s ensures the absolute convergence of the series. A 
crucial fact about this special function is that it satisfies the following 
functional equation. 

Theorem 3.2 s - 1 / 2 i?(l/s) = 1 9(s) whenever s > 0. 

The proof of this identity consists of a simple application of the Poisson 
summation formula to the pair 

f{x) = e~ nsx2 and /(£) = S - 1/2 e~^ 2/s . 


The theta function i?(s) also extends to complex values of s when 
Re(s) > 0, and the functional equation is still valid then. The theta 
function is intimately connected with an important function in number 
theory, the zeta function ((s) defined for Re(s) > 1 by 


OO 

CM = E ^ 


n= 1 


Later we will see that this function carries essential information about 
the prime numbers (see Chapter 8 ). 

It also turns out that £, i?, and another important function P are 
related by the following identity: 

1 C°° 

7T“ S ' /2 r(s/2)C(s) = - 1 ) dt, 
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which is valid for s > 1 (Exercises 17 and 18). 

Returning to the function i?, define the generalization 0(z|r) given by 


0(z|t) 


E 


giirn 2 r ^2irinz 


n =— oo 


whenever Im(r) > 0 and z G C. Taking z = 0 and r = is we get Q(z\r) = 
■d(s). 

3.2 Heat kernels 

Another application related to the Poisson summation formula and the 
theta function is the time-dependent heat equation on the circle. A 
solution to the equation 

du d 2 u 
dt dx 2 

subject to u(x, 0) = f(x), where / is periodic of period 1, was given in 
the previous chapter by 


u(x,t) = (f*H t )(x) 

where H t (x) is the heat kernel on the circle, that is, 

OO 

H t (x)= e~^ 2n2t e 2ninx . 

n =— oo 


Note in particular that with our definition of the generalized theta func- 
tion in the previous section, we have 0(x|47 Tit) = H t (x). Also, recall that 
the heat equation on M gave rise to the heat kernel 


7i t (x) 


1 e -* 2 /4t 

(Ant) 1 / 2 


where Ht(0 — e 47r “5“ t . The fundamental relation between these two 
objects is an immediate consequence of the Poisson summation formula: 

Theorem 3.3 The heat kernel on the circle is the periodization of the 
heat kernel on the real line: 


OO 

Ht(x)= Y fttix + n). 


n =— oo 
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Although the proof that Tit is a good kernel on R was fairly straightfor- 
ward, we left open the harder problem that Ht is a good kernel on the 
circle. The above results allow us to resolve this matter. 

Corollary 3.4 The kernel Ht(x) is a good kernel for t — > 0. 

Proof. We already observed that /| x |<i/ 2 Ht(x) dx — 1. Now note 
that H t > 0, which is immediate from the above formula since Tit > 0. 
Finally, we claim that when x < 1/2, 

H t (x) = H t (x) + £ t (x), 

where the error satisfies \£t{x)\ < cie _C2 /* with ci,c 2 > 0 and 0 < t < 1. 
To see this, note again that the formula in the theorem gives 

H t (x) = Titfx ) + ^ Titfx + n); 

|n|>l 


therefore, since \x\ < 1/2, 


S t (x) = -j= Y e“^ +n ) ' u < Ct ~ 1/2 
| n |> 1 


Y e~ m jt - 

n> 1 


Note that n 2 /t > n 2 and n 2 /t > 1 jt whenever 0 < t < 1, so e c ” 2 / t < 
e ~ 2 n e~ 2 i. Hence 

\£ t (x)\ < Ct~ x / 2 e "St ^ e-5" 2 < cie” 02 /*. 

n>l 


The proof of the claim is complete, and as a result l /'|- c -| <1 / 2 \^t{x)\ dx — > 0 
as t — > 0. It is now clear that H t satisfies 


/?7<|a:|<l/2 


\H t (x)\ dx —> 0 as t — > 0, 


because Tit does. 


3.3 Poisson kernels 


In a similar manner to the discussion above about the heat kernels, we 
state the relation between the Poisson kernels for the disc and the upper 
half-plane where 


P r (0) - 


i y 

7 r y 2 + x 2 


1 — 2 r cos 6 + r 2 


and V y {x) 
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Theorem 3.5 P r (2nx) = J2 n ez'^’v( x + n ) where r = e~ 2ny . 

This is again an immediate corollary of the Poisson summation formula 
applied to f(x) = V y {x) and /(£) = e~ 2n ^ v . Of course, here we use the 
Poisson summation formula under the assumptions that / and / are of 
moderate decrease. 

4 The Heisenberg uncertainty principle 

The mathematical thrust of the principle can be formulated in terms of a 
relation between a function and its Fourier transform. The basic under- 
lying law, formulated in its vaguest and most general form, states that a 
function and its Fourier transform cannot both be essentially localized. 
Somewhat more precisely, if the “preponderance” of the mass of a func- 
tion is concentrated in an interval of length L, then the preponderance 
of the mass of its Fourier transform cannot lie in an interval of length 
essentially smaller than L -1 . The exact statement is as follows. 

Theorem 4.1 Suppose ip is a function in <S(M) which satisfies the nor- 
malizing condition \ip(x)\ 2 dx — 1. Then 

{ij‘\' Kx) ' ?dx ) (. Lj a leb* 

and equality holds if and only if ip{x) = Ae~ Bx where B > 0 and \A\ 2 — 
In fact, we have 

(y & ~ x 0 ) 2 \ip(x)\ 2 dx^j (y (£-&) 2 i$(oi 2d s) ^ ^2 

for every Xo;£o € 

Proof. The second inequality actually follows from the first by re- 
placing ip(x) by e~ 2mx ^°ip(x + xo ) and changing variables. To prove the 
first inequality, we argue as follows. Beginning with our normalizing as- 
sumption f \ip\ 2 = 1, and recalling that ip and ip' are rapidly decreasing, 
an integration by parts gives 
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The last identity follows because \ip\ 2 — ipip. Therefore 



where we have used the Cauchy-Schwarz inequality. The identity 

/ oo roo 

W{ x )\ 2 dx — 47t 2 / C 2 |^)| 2 ^, 

-oo J — oo 

which holds because of the properties of the Fourier transform and the 
Plancherel formula, concludes the proof of the inequality in the theorem. 

If equality holds, then we must also have equality where we applied the 
Cauchy-Schwarz inequality, and as a result we find that i/j'(x) = /3xip(x) 
for some constant (3. The solutions to this equation are ip(x) — Ae^ x Z 2 , 
where A is constant. Since we want ip to be a Schwartz function, we must 
take fd = —2 B < 0, and since we impose the condition \tp(x)\ 2 dx — 1 
we find that |A| 2 = y 7 2B/n , as was to be shown. 

The precise assertion contained in Theorem 4.1 first came to light in 
the study of quantum mechanics. It arose when one considered the extent 
to which one could simultaneously locate the position and momentum of 
a particle. Assuming we are dealing with (say) an electron that travels 
along the real line, then according to the laws of physics, matters are 
governed by a “state function” ip, which we can assume to be in 5(M), 
and which is normalized according to the requirement that 

/ OO 

\ip(x)\ 2 dx - 1. 

-OO 

The position of the particle is then determined not as a definite point x\ 
instead its probable location is given by the rules of quantum mechanics 
as follows: 

• The probability that the particle is located in the interval (a, b ) is 
f*\ip(x)\ 2 dx. 

According to this law we can calculate the probable location of the 
particle with the aid of ip: in fact, there may be only a small probability 
that the particle is located in a given interval (a' ,b'), but nevertheless it 
is somewhere on the real line since fp° \ip(x)\ 2 dx = 1. 
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In addition to the probability density \ip(x)\ 2 dx, there is the ex- 
pectation of where the particle might be. This expectation is the best 
guess of the position of the particle, given its probability distribution 
determined by \i/j(x)\ 2 dx, and is the quantity defined by 

/ OO 

x\ip(x)\ 2 dx. 

-OO 

Why is this our best guess? Consider the simpler (idealized) situation 
where we are given that the particle can be found at only finitely many 
different points, aq, £ 2 , • • • , aw on the real axis, with yq the probability 
that the particle is at aq, and yq + P 2 + • • • + Pn = 1- Then, if we knew 
nothing else, and were forced to make one choice as to the position of the 
particle, we would naturally take x — JfiLi x iPi> which is the appropriate 
weighted average of the possible positions. The quantity (12) is clearly 
the general (integral) version of this. 

We next come to the notion of variance, which in our terminology is 
the uncertainty attached to our expectation. Having determined that 
the expected position of the particle is x (given by (12)), the resulting 
uncertainty is the quantity 

/ OO 

(x — x) 2 \ip(x)\ 2 dx. 

-OO 

Notice that if ip is highly concentrated near x, it means that there is a 
high probability that x is near x, and so (13) is small, because most of 
the contribution to the integral takes place for values of x near x. Here 
we have a small uncertainty. On the other hand, if ip ( x ) is rather flat 
(that is, the probability distribution \ip(x)\ 2 dx is not very concentrated), 
then the integral (13) is rather big, because large values of (x — x) 2 will 
come into play, and as a result the uncertainty is relatively large. 

It is also worthwhile to observe that the expectation x is that choice 
for which the uncertainty /^(x — x) 2 \ip(x)\ 2 dx is the smallest. Indeed, 
if we try to minimize this quantity by equating to 0 its derivative with 
respect to x. we find that 2 /^(x — x)\ip(x)\ 2 dx = 0, which gives (12). 

So far, we have discussed the “expectation” and “uncertainty” related 
to the position of the particle. Of equal relevance are the corresponding 
notions regarding its momentum. The corresponding rule of quantum 
mechanics is: 

• The probability that the momentum £ of the particle belongs to 
the interval ( a,b ) is |^>(£)| 2 where $ is the Fourier transform 
of ip. 
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Combining these two laws with Theorem 4.1 gives 1/167T 2 as the lower 
bound for the product of the uncertainty of the position and the uncer- 
tainty of the momentum of a particle. So the more certain we are about 
the location of the particle, the less certain we can be about its mo- 
mentum, and vice versa. However, we have simplified the statement of 
the two laws by rescaling to change the units of measurement. Actually, 
there enters a fundamental but small physical number h called Planck’s 
constant. When properly taken into account, the physical conclusion is 

(uncertainty of position) x (uncertainty of momentum) > h/16n 2 . 


5 Exercises 


1. Corollary 2.3 in Chapter 2 leads to the following simplified version of the 
Fourier inversion formula. Suppose / is a continuous function supported on an 
interval [— M, M], whose Fourier transform / is of moderate decrease. 

(a) Fix L with L/2 > M, and show that f{x) = Y a n (L)e 27rmx / L where 

r L/2 
I -L/2 

Alternatively, we may write f(x) = S Y^/L-oo / (n5)e 2 ' KlnSx with 5 = 1/L. 

(b) Prove that if F is continuous and of moderate decrease, then 


1 r L / 2 i 

a„(L) = T / f(x)e~ 2 ” nx ' L dx = jf(n/L). 
L J-L/2 L 


/ oo x 

F(£)d£ = lim 5 F(5n). 

'OO ! Tn ^ ^ 


> o n =— oo 


(c) Conclude that f(x) = / 


[Hint: For (a), note that the Fourier series of / on [— L/2, L/2] converges ab- 

rN 

solutely. For (b), first approximate the integral by J_ N F and the sum by 
6 Y\ n \<N/s F( n 5). Then approximate the second integral by Riemann sums.] 


2. Let / and g be the functions dehned by 

and g(x) = 


f(x) = X'j-i.i] (x) = { J 


1 if |a:| < 1, 


otherwise, 


1 — ja:| if |a:| < 1, 


0 


otherwise. 


Although / is not continuous, the integral defining its Fourier transform still 
makes sense. Show that 


f(0 = and 9(0 = 


sin 7r^ 
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with the understanding that /( 0) = 2 and g(0) = 1. 

3. The following exercise illustrates the principle that the decay of / is related 
to the continuity properties of /. 

(a) Suppose that / is a function of moderate decrease on R whose Fourier 
transform / is continuous and satisfies 

f(0 = O as |f| -> OO 

for some 0 < a < 1. Prove that / satisfies a Holder condition of order a, 
that is, that 

| f(x + h) — f(x ) | < M\h\ a for some M > 0 and all x,h & R. 

(b) Let / be a continuous function on R which vanishes for \x\ > 1, with 
/( 0) = 0, and which is equal to 1/ log(l/|x|) for all x in a neighborhood 
of the origin. Prove that / is not of moderate decrease. In fact, there is 
no e > 0 so that /(f) = 0(l/|f| 1+e ) as |f| — > oo. 

[Hint: For part (a), use the Fourier inversion formula to express f(x + h) — /( x) 
as an integral involving /, and estimate this integral separately for f in the two 
ranges |f| < l/|/i| and |f| > l/|/i|.] 

4. Bump functions. Examples of compactly supported functions in 5(R) are 
very handy in many applications in analysis. Some examples are: 

(a) Suppose a < b, and / is the function such that f(x) = 0 if x < a or x > b 
and 

/( x) = e-V^-^e-i Kb-x) if a <x<b. 

Show that / is indefinitely differentiable on R. 

(b) Prove that there exists an indefinitely differentiable function F on R such 
that F(x) = 0 if x < a, F(x ) = 1 if x > b, and F is strictly increasing on 
[a,b], 

(c) Let 5 > 0 be so small that a + S < b — 5. Show that there exists an indef- 
initely differentiable function g such that g is 0 if x < a or x > b, g is 1 on 
[o + 5, b — <5] , and g is strictly monotonic on [a, a + <5] and [b — d, b]. 

[Hint: For (b) consider F(x) = c J f(t ) dt where c is an appropriate constant.] 

5. Suppose / is continuous and of moderate decrease. 

(a) Prove that / is continuous and /(f) — > 0 as |f| — * oo. 
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(b) Show that if /(£) = 0 for all £, then / is identically 0. 

[Hint: For part (a), show that /(£) = | fT^[f( x ) ~ fi x ~ l/(2£))]e~ 2,r ' x £ dx. 
For part (b), verify that the multiplication formula J f(x)g(x ) dx = f f(y)g(y ) dy 
still holds whenever g £ iS(R).] 

2 

6. The function e~ nx is its own Fourier transform. Generate other functions 
that (up to a constant multiple) are their own Fourier transforms. What must 
the constant multiples be? To decide this, prove that J- A = I. Here T{ f ) = / 
is the Fourier transform, = ToToToT , and I is the identity operator 
(//)( x) = f(x) (see also Problem 7). 

7 . Prove that the convolution of two functions of moderate decrease is a function 
of moderate decrease. 

[Hint: Write 


/ f(x - y)g(y) dy = / + 

J J\y\<\x\/2 J\y\>\x\/2 

In the first integral f(x — y) = 0(1/(1 + x 2 )) while in the second integral 
g(y) = 0( 1/(1 + Z 2 )).] 

8. Prove that / is continuous, of moderate decrease, and f (y)e~ v ~ e 2xy dy =0 
for all itR, then / = 0. 

[Hint: Consider / * e _x2 .[ 


9. If / is of moderate decrease, then 

(14) (l - ^ /(Oe 2 ™ f # = (/ * ?r)(x), 


where the Fejer kernel on the real line is dehned by 


( sin 7r tR \ 2 

R y^tir) lf ^ 0, 

R if t = 0. 


Show that {Tr} is a family of good kernels as R — > oo, and therefore (14) tends 
uniformly to f{x) as R — > oo. This is the analogue of Fejer’s theorem for Fourier 
series in the context of the Fourier transform. 


10 . Below is an outline of a different proof of the Weierstrass approximation 
theorem. 
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Define the Landau kernels by 


-bn(*) 


(i - x 2 y 


if — 1 < x < 1, 
if |*| > 1, 


where c n is chosen so that L n (x) dx = 1. Prove that {L n } n > 0 is a family 
of good kernels as n — » oo. As a result, show that if / is a continuous func- 
tion supported in [—1/2, 1/2], then (/ * L n ){x) is a sequence of polynomials on 
[—1/2, 1/2] which converges uniformly to /. 

[Hint: First show that c n > 2/(n + 1).] 


11. Suppose that u is the solution to the heat equation given by u = f * Tit 
where / 6 5(11). If we also set u(x, 0) = fix), prove that u is continuous on the 
closure of the upper half-plane, and vanishes at infinity, that is, 

u(x, t) — > 0 as |*| + t — > oo. 

[Hint: To prove that u vanishes at infinity, show that (i) | u(x, t)| < C/yi and (ii) 
\u(x,t)\ < (7/(1 + |*| 2 ) + (7t _1 / 2 e _CT "/ t . Use (i) when |*| < t, and (ii) other- 
wise.] 


12 . Show that the function defined by 

X 

u(x,t) = -H t {x) 

satisfies the heat equation for t > 0 and lim t ^ 0 u(x, t) = 0 for every x, but u is 
not continuous at the origin. 

[Hint: Approach the origin with (x,t) on the parabola * 2 /4f = c where c is a 
constant.] 


13 . Prove the following uniqueness theorem for harmonic functions in the strip 
{(*, y) : 0 < y < 1, — oo < x < oo}: if u is harmonic in the strip, continuous on 
its closure with u(x, 0) = u(x, 1) = 0 for all icR, and u vanishes at infinity, 
then u = 0. 


14 . Prove that the periodization of the Fejer kernel J-j\i on the real line (Exer- 
cise 9) is equal to the Fejer kernel for periodic functions of period 1. In other 
words, 

OO 

y T N (x + n) = F n (x), 

n= — o o 


when IV > 1 is an integer, and where 


F n (x) = 


N 

E 

n=—N 


\n\ 
1 “ ~N 


1 sin 2 (A r 7r*) 
N sin 2 (7ra:) 
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15. This exercise provides another example of periodization. 

(a) Apply the Poisson summation formula to the function g in Exercise 2 to 
obtain 

00 i 2 

E l 7T 

(n + a) 2 (sin7ra) 2 

n= — oo 

whenever a is real, but not equal to an integer. 

(b) Prove as a consequence that 


(n + a) tan n a 

n= — oo 

whenever a is real but not equal to an integer. [Hint: First prove it when 
0 < a < 1. To do so, integrate the formula in (b). What is the precise 
meaning of the series on the left-hand side of (15)? Evaluate at a = 1/2.] 


( 15 ) E 


16. The Dirichlet kernel on the real line is defined by 

r R 


' -R 


f(0 e ir,a * d£ = (f* V R ){x) so that V R {x) = X[-r,r](x) = 


sin(27r Rx) 


Also, the modified Dirichlet kernel for periodic functions of period 1 is defined 
by 


d *n(x) = E e2 

\n\<N-l 


+ 2 (e 


— 2tviNx _|_ ^iriNx'j 


Show that the result in Exercise 15 gives 


OO 

E ^JvC^ + n) = D* n (x), 

n=— oo 

where N > 1 is an integer, and the infinite series must be summed symmetrically. 
In other words, the periodization of Hat is the modified Dirichlet kernel D^. 


17. The gamma function is defined for s > 0 by 


T(s) 


^x 3 - 1 dx. 


(a) Show that for s > 0 the above integral makes sense, that is, that the 
following two limits exist: 


Inn 

5 0 

6 > 0 



^—x , 5—1 

e x 


dx 


and 


r A 

lim / e~ x x 3 ~ 1 dx. 
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(b) Prove that T(s + 1) = sr(s) whenever s > 0, and conclude that for every 
integer n > 1 we have T(n + 1) = n\. 

(c) Show that 

r (0 = ^ “ d 
[Hint: For (c), use e -7 ™ 2 dx = 1.] 

18. The zeta function is defined for s > 1 by (( s ) = 1 /n s . Verify the 

identity 

1 f°° 

7i-- s / 2 r(s/2)C(s) = - / 1 5_1 (tl(<) — 1) dt whenever s > 1 

2 Jo 

where T and d are the gamma and theta functions, respectively: 


F(s) 


dt 


OO 

and d(s)= e-™* 3 . 

n= — oo 


More about the zeta function and its relation to the prime number theorem can 
be found in Book II. 


19. The following is a variant of the calculation of £(2m) = YJ,T=i l/^ 2m found 
in Problem 4, Chapter 3. 

(a) Apply the Poisson summation formula to /( x) = t/{n{x 2 + 1 2 )) 
and /(£) = e~ 2nt ^ where t > 0 in order to get 

oc 

1 = V e ~ 2lvt M 

t 2 + n 2 ^ 

n= — oo n =— oo 



(b) Prove the following identity valid for 0 < t < 1: 



7T £2 n 2 


n =— oo 


i o ^ 

— + A V(-l) m+1 C(2m)t 2m - 1 

TVt 7T 

771— 1 


as well as 


E 


27 ri |n| 


2 


- 1. 


n= — oc 


1 - e _27rt 
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(c) Use the fact that 


0 — = 1 - - + V 
- 1 2 ^ 


z 

e* — 1 


B 


2m 2m 


2 ' ^ (2m)! 

m= 1 V ' 

where B are the Bernoulli numbers to deduce from the above formula, 

m+ i (2^) 2m 


2C(2m) = (-1) 


(2m) l 


Bo 


20. The following results are relevant in information theory when one tries to 
recover a signal from its samples. 

Suppose / is of moderate decrease and that its Fourier transform / is sup- 
ported in I = [—1/2, 1/2]. Then, / is entirely determined by its restriction to 
Z. This means that if g is another function of moderate decrease whose Fourier 
transform is supported in I and f(n) = g(n ) for all n G Z, then / = g. More 
precisely: 

(a) Prove that the following reconstruction formula holds: 


OO 

f( x )= f( n ) K ( x ~ n ) 

n= — oo 


where K(y ) 


sin ny 
ny 


Note that K(y) = 0(l/\y\) as \y\ —> oo. 
(b) If A > 1, then 


/(*) 




where K\(y) 


cos ny — cos nXy 
tt 2 (A - 1 )y 2 


Thus, if one samples / “more often,” the series in the reconstruction 
formula converges faster since K\(y) = 0(l/\y\ 2 ) as | 2 /| — * oo. Note that 
K\(y) -» K(y) as A — ► 1. 


(c) Prove that 


\f(x)\ 2 dx 


c>o 

E i«">t 

n= — oo 


[Hint: For part (a) show that if \ is the characteristic function of I, then 
/(£) = x(£) f(n)e~ 2mn ^. For (b) use the function in Figure 2 instead 

of x(£)-] 

21. Suppose that / is continuous on R. Show that / and / cannot both be 
compactly supported unless / = 0. This can be viewed in the same spirit as the 
uncertainty principle. 
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[Hint: Assume / is supported in [0,1/2]. Expand / in a Fourier series in the 
interval [0, 1], and note that as a result, / is a trigonometric polynomial.] 


22. The heuristic assertion stated before Theorem 4.1 can be made precise as 
follows. If F is a function on R, then we say that the preponderance of its mass 
is contained in an interval I (centered at the origin) if 


(16) 


x 2 \F(x)\ 2 dx > - 


x 2 \F[x)\ 2 dx. 


Now suppose / 6 5, and (16) holds with F = f and I = I\; also with F = / and 
I = / 2 . Then if Lj denotes the length of Ij , we have 


A similar conclusion holds if the intervals are not necessarily centered at the 
origin. 


23. The Heisenberg uncertainty principle can be formulated in terms of the 

,2 q 

operator L = — + ar , which acts on Schwartz functions by the formula 


Hf) 


dx 2 


+ x 2 f. 


This operator, sometimes called the Hermite operator, is the quantum ana- 
logue of the harmonic oscillator. Consider the usual inner product on S given 
by 


if, 9) 


f(x)g(x) dx 


whenever f,g e S. 
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(a) Prove that the Heisenberg uncertainty principle implies 

(Lf, /)>(/,/) for all fe S. 

This is usually denoted by L > I . [Hint: Integrate by parts.] 

(b) Consider the operators A and A* defined on S by 

A{f) = l +Xf ^ ^ (/) = “l +X/ - 

The operators A and A* are sometimes called the annihilation and cre- 
ation operators, respectively. Prove that for all f,g 6 S we have 

(i) (Af,g) = (f,A*g), 

(ii) {Af,Af) = (A*Af,f)> 0, 

(iii) A* A = L — I. 

In particular, this again shows that L > I . 

(c) Now for t £ R, let 

A{f) = +txf and A*(f) = -j-+ txf. 

Use the fact that (A^A t f, /) > 0 to give another proof of the Heisenberg 
uncertainty principle which says that whenever J !/(a:)| 2 dx = 1 then 


x‘ 2 \f(x)\ 2 dx 




> 1/4. 


[Hint: Think of {A^A t f,f) as a quadratic polynomial in t.[ 


6 Problems 


1. The equation 
(17) 


2 d 2 u du du 
X dx* +aX dx = dt 


with u(x, 0) = f(x) for 0 < x < oc and t > 0 is a variant of the heat equation 
which occurs in a number of applications. To solve (17), make the change of vari- 
ables x = e~ v so that —oo<y<oo. Set U(y,t) = u(e~ y ,f) and 
F(y) = /(e _y )- Then the problem reduces to the equation 


d 2 U _ dU _ dU 
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with U(y, 0) = F(y). This can be solved like the usual heat equation (the case 
a — 1) by taking the Fourier transform in the y variable. One must then compute 
the integral e (-^ 2 Z 2 +A-a) 2 nit)t e 2 ni(v ^ Show that the solution of the 
original problem is then given by 


u(x, t ) = 


- 1 r p -( log(v/x)+(l-a)t) 2 /(4t) f ( \ ^ 

] (47 rt)V2 J 0 V) V ■ 


2. The Black-Scholes equation from finance theory is 


(18) 


dV dV a 2 s 2 d 2 V 
~dt +TS ~ds + ~2~ ~d^ 


— rV = 0, 


0 <t<T, 


subject to the “hnal” boundary condition V(s, T) = F(s). An appropriate change 
of variables reduces this to the equation in Problem 1. Alternatively, the substi- 
tution V(s,t) = e ax+br U(x,T) where x = logs, r = yr(T — t), a = \ — ^ , and 

b = — (| + yj) 2 reduces (18) to the one-dimensional heat equation with the ini- 
tial condition U(x,0) = e~ ax F(e x ). Thus a solution to the Black-Scholes equa- 
tion is 


V(s,t) = 


a~r{T-t ) 


ra 2 (T-t) 


’ (log(»/»*) + (r-,T- i /2)(T-t)) 2 

e F(s*)ds*. 


3. * The Dirichlet problem in a strip. Consider the equation A u = 0 in the 
horizontal strip 

{(x, y) : 0 < y < 1, —oo < x < oo} 

with boundary conditions u(x, 0) = fo(x) and u( x, 1) = fi(x), where /o and /i 
are both in the Schwartz space. 

(a) Show (formally) that if u is a solution to this problem, then 

u{^y) = A{£)e 2 ^y + B{i)e~ 2 ^ v ■ 


Express A and B in terms of f 0 and fi , and show that 




sinh(27r£) 


sinh(27r£) 


(b) Prove as a result that 


\u(x, y) — fo(x)\ 2 dx — >■ 0 as y — > 0 


and 


/ OO 

\u(x, y) — fi{x)\ 2 dx — > 0 as y — > 1. 

-OO 
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(c) If $(0 = (sinh27ra£)/(smh27r£), with 0 < a < 1, then $ is the Fourier 
transform of ip where 

sin 7r a 1 

<p(x) = • ; . 

2 cosh ttx + cos Tva 

This can be shown, for instance, by using contour integration and the 
residue formula from complex analysis (see Book II, Chapter 3). 

(d) Use this result to express u in terms of Poisson-like integrals involving /o 
and /i as follows: 

„(»,„> =^( r «*-*> dt+ r m*- { > 

2 \J-oo cosh nt — cos ny _/_ 00 cosh nt + cos Try 

(e) Finally, one can check that the function u(x, y) defined by the above ex- 
pression is harmonic in the strip, and converges uniformly to fo{x) as 
y — > 0, and to fi{x) as y — > 1. Moreover, one sees that u(x,y) vanishes at 
infinity, that is, lini|, I ,|^ (x) u(x,y) = 0, uniformly in y. 



In Exercise 12, we gave an example of a function that satisfies the heat equation 
in the upper half-plane, with boundary value 0, but which was not identically 0. 
We observed in this case that u was in fact not continuous up to the boundary. 

In Problem 4 we exhibit examples illustrating non-uniqueness, but this time with 
continuity up to the boundary t = 0. These examples satisfy a growth condition 
at infinity, namely \u(x,t)\ < Ce cx , for any t > 0. Problems 5 and 6 show 
that under the more restrictive growth condition | u(x, t)\ < Ce cx , uniqueness does 
hold. 


4.* If g is a smooth function on R, define the formal power series 

oo 2 n 

(! 9 ) u ( x ^ = Y,3 (n) ^T^Ty 

71=0 ’’ 

(a) Check formally that u solves the heat equation. 

(b) For a > 0, consider the function defined by 


9(t) 


e -t if t > 0 

0 if t < 0. 


One can show that there exists 0 < 0 < 1 depending on a so that 


g (k) (t ) I < 



for t > 0. 


(c) As a result, for each x and t the series (19) converges; u solves the heat 
equation; u vanishes for t = 0; and u satisfies the estimate |w(:r,t)| < 
Ce c M 2 “ /( “ 11 for some constants C, c > 0. 
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(d) Conclude that for every e > 0 there exists a non-zero solution to the heat 
equation which is continuous for X 6 R and t > 0, which satisfies u(x, 0) = 
0 and |u(x,f)| < C , e c W 2+ ' . 


5.* The following “maximum principle” for solutions of the heat equation will 
be used in the next problem. 

Theorem. Suppose that u(x, t) is a real-valued solution of the heat equation 
in the upper half-plane, which is continuous on its closure. Let R denote the 
rectangle 

R = {(x, y) G R 2 : a < x <b, 0 < t < c} 

and d'R be the part of the boundary of R which consists of the two vertical sides 
and its base on the line t = 0 ( see Figure 3). Then 

min u(x,t) = min u(x,t) and max u(x,t) = max u(x,t). 
(x,t)€:d' R (x,t)£R (x,t)€d'R (x,t)^R 



d'R 


Figure 3. The rectangle R and part of its boundary d'R 


The steps leading to a proof of this result are outlined below. 

(a) Show that it suffices to prove that if u > 0 on d' R, then u > 0 in R. 

(b) For e > 0, let 

v(x , t) = u(x , t) + et. 

Then, v has a minimum on R, say at (x\,t\). Show that X\ = a or b, 
or else ti = 0. To do so, suppose on the contrary that a < x± < b and 
0 < t± < c, and prove that v xx (x\,t\) — v t (xi,ti) < —e. However, show 
also that the left-hand side must be non-negative. 

(c) Deduce from (b) that u(x, t) > e(ti — t ) for any (x, t) £ R and let e — > 0. 


6.* The examples in Problem 4 are optimal in the sense of the following unique- 
ness theorem due to Tychonoff. 


Ibookroot October 20, 2007 


6. Problems 173 

Theorem. Suppose u(x , t ) satisfies the following conditions: 

(i) u(x,t) solves the heat equation for all x and and all t > 0. 

(ii) u(x , t ) is continuous for all x 6l and 0 < t < c. 

(iii) u(x, 0) = 0. 

(iv) \u(x,t)\ < Me ax2 for some M, a, and all x £ R, 0 < t < c. 

Then u is identically equal to 0. 


7.* The Hermite functions hk(x) are defined by the generating identity 

oo k 

^ =e-(* 2 / 2 - 2te+i2 ). 

k = 0 

(a) Show that an alternate definition of the Hermite functions is given by the 
formula 

[Hint: Write e -0 2 / 2 - 2te +* 2 ) = e P/ 2 e ~(x~t) 2 and use Taylor’s formula.] 
Conclude from the above expression that each hk(x) is of the form 
Pk(x)e~ x / 2 , where Pk is a polynomial of degree k. In particular, the Her- 
mite functions belong to the Schwartz space and h 0 (x) = e~ x / 2 , 
hi(x) = 2xe~ x2 !‘ 1 . 

(b) Prove that the family {hk}^L 0 is complete in the sense that if / is a 
Schwartz function, and 

/ OO 

f{x)hk{x) dx = 0 for all k > 0, 

-OO 

then / = 0. [Hint: Use Exercise 8.] 

(c) Define h* k (x ) = hk(( 2 tt) 1 /‘ 2 x). Then 

%) = 

Therefore, each h* k is an eigenfunction for the Fourier transform. 

(d) Show that h k is an eigenfunction for the operator defined in Exercise 23, 
and in fact, prove that 

Lhk = (2k + 1 )hk- 

In particular, we conclude that the functions hk are mutually orthogonal 
for the L 2 inner product on the Schwartz space. 

(e) Finally, show that f^° [hk(x)] 2 dx = 7r 1 / 2 2 fc fc!. [Hint: Square the generat- 
ing relation.] 
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8.* To refine the results in Chapter 4, and to prove that 


OO 

f a ( x) = ^22~ na e 2ni2nx 

n = 0 

is nowhere differentiable even in the case a = 1 , we need to consider a variant of 
the delayed means Ajv, which in turn will be analyzed by the Poisson summation 
formula. 


(a) Fix an indefinitely differentiable function 4> satisfying 


m 


1 when |£| < 1, 
0 when |£| > 2. 


By the Fourier inversion formula, there exists ip 6 S so that </>(£) = 4>(£). 
Let Pn{x) = Ntp(Nx) so that <^iv(C) = 4>(£/lV). Finally, set 


OO 

A N (x)= ^2 p N (x + n). 

n =— oo 


Observe by the Poisson summation formula that An{x) = 
Y^^=-oo &(n/N)e 2nlnx , thus Ajv is a trigonometric polynomial of degree 
< 21V, with terms whose coefficients are 1 when \n\ < N. Let 

A iv(/) = / * Ajv- 


Note that 

S N (f a ) = A N ,(f a ) 

where N' is the largest integer of the form 2 k with N' < N. 

(b) If we set An(x) = Pn(x) + E^{x) where 

E n (x) = ^ Pn(x + ti), 

\n\>l 

then one sees that: 


(i) sup| x |< 1/2 \E' n (x)\ -> 0 as N -> oo. 

(ii) \A' n (x)\<cN 2 . 

(iii) |A^ v (a:)| < c/(N\x\ z ), for \x\ < 1/2. 

Moreover, A' N (x) dx = 0, and — I\ x \<i /2 xA' N {x) dx — > 1 as N — > 

oo. 

(c) The above estimates imply that if f'{x o) exists, then 

(f * A' N )(x 0 + h N ) ^ f'(x 0 ) as IV ^ oo, 

whenever |/ijv| < C/N. Then, conclude that both the real and imaginary 
parts of fi are nowhere differentiable, as in the proof given in Section 3, 
Chapter 4. 
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It occurred to me that in order to improve treatment 
planning one had to know the distribution of the at- 
tenuation coefficient of tissues in the body. This in- 
formation would be useful for diagnostic purposes and 
would constitute a tomogram or series of tomograms. 

It was immediately evident that the problem was 
a mathematical one. If a fine beam of gamma rays 
of intensity 7o is incident on the body and the emerg- 
ing density is 7, then the measurable quantity g equals 
log(7o//) = J f ds, where / is the variable absorption 
coefficient along the line L. Hence if / is a function of 
two dimensions, and g is known for all lines intersect- 
ing the body, the question is, can / be determined if 
g is known? 

Fourteen years would elapse before I learned that 
Radon had solved this problem in 1917. 

A. M. Cormack, 1979 


The previous chapter introduced the theory of the Fourier transform 
on M and illustrated some of its applications to partial differential equa- 
tions. Here, our aim is to present an analogous theory for functions of 
several variables. 

After a brief review of some relevant notions in M d , we begin with some 
general facts about the Fourier transform on the Schwartz space <S(R d ). 
Fortunately, the main ideas and techniques have already been considered 
in the one-dimensional case. In fact, with the appropriate notation, the 
statements (and proofs) of the key theorems, such as the Fourier inversion 
and Plancherel formulas, remain unchanged. 

Next, we highlight the connection to some higher dimensional prob- 
lems in mathematical physics, and in particular we investigate the wave 
equation in d dimensions, with a detailed analysis in the cases d — 3 
and d — 2. At this stage, we discover a rich interplay between the Fourier 
transform and rotational symmetry, that arises only in K. d when d > 2. 

Finally, the chapter ends with a discussion of the Radon transform. 
This topic is of substantial interest in its own right, but in addition has 
significant relevance in its application to the use of X-ray scans as well 
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as to other parts of mathematics. 

1 Preliminaries 

The setting in this chapter will be the vector space 1 of all d-tuples of 
real numbers (aq, . . . , Xd) with Xi € M. Addition of vectors is component- 
wise, and so is multiplication by real scalars. Given x — {x \ , . . . , Xd) € 
we define 


l*l = (®i + -" + ®d) 1/2 , 

so that |x| is simply the length of the vector x in the usual Euclidean 
norm. In fact, we equip with the standard inner product defined by 


x-y = xiyi H 1- XdVd, 


so that \x\ 2 = x ■ x. We use the notation x-y in place of (x, y) of Chap- 
ter 3. 

Given a d-tuple a = (oq, . . . , ad) of non- negative integers (sometimes 
called a multi-index), the monomial x a is defined by 


Similarly, we define the differential operator (d/dx) a by 

( oy _( d \ ai / ay 2 / a \" d _ cH 

yds/ \dx2j dx^ 1 ■ ■ ■ dx^ d ’ 

where |a| = ap + • • * + is the order of the multi-index a. 

1.1 Symmetries 

Analysis in M d , and in particular the theory of the Fourier transform, is 
shaped by three important groups of symmetries of the underlying space: 

(i) Translations 

(ii) Dilations 

(iii) Rotations 


1 See Chapter 3 for a brief review of vector spaces and inner products. Here we find it 
convenient to use lower case letters such as x (as opposed to X) to designate points in 

Also, we use | • | instead of || • || to denote the Euclidean norm. 
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We have seen that translations x h- * x + h, with h E IR d hxed, and dila- 
tions x i— > Sx, with 5 > 0, play an important role in the one-dimensional 
theory. In IR, the only two rotations are the identity and multiplica- 
tion by —1. However, in M. d with d > 2 there are more rotations, and 
the understanding of the interaction between the Fourier transform and 
rotations leads to fruitful insights regarding spherical symmetries. 

A rotation in U. d is a linear transformation R : IR d — > IR d which pre- 
serves the inner product. In other words, 

• R(ax + by) = aR(x) + bR(y ) for all x, y € IR d and a, b 6 M. 

• R(x) ■ R(y) = x ■ y for all x, y € IR d . 

Equivalently, this last condition can be replaced by |i?(a:)| = \x\ for all 
x € IR d , or R* = i ? _1 where R l and R denote the transpose and inverse 
of R, respectively . 2 In particular, we have det(-R) = ±1, where det (R) is 
the determinant of R. If det (R) — 1 we say that R is a proper rotation; 
otherwise, we say that R is an improper rotation. 

Example 1. On the real line IR, there are two rotations: the identity 
which is proper, and multiplication by —1 which is improper. 

Example 2. The rotations in the plane M 2 can be described in terms of 
complex numbers. We identify K 2 with C by assigning the point (x, y) 
to the complex number z = x + iy. Under this identification, all proper 
rotations are of the form 2 ze lip for some i^Gl, and all improper rota- 
tions are of the form z i— > ze zv> for some ip € M (here, z — x — iy denotes 
the complex conjugate of z). See Exercise 1 for the argument leading to 
this result. 

Example 3. Euler gave the following very simple geometric description 
of rotations in IR 3 . Given a proper rotation R, there exists a unit vector 
7 so that: 

(i) R fixes 7 , that is, R( 7 ) = 7 . 

(ii) If V denotes the plane passing through the origin and perpendicular 
to 7 , then R : V — > V, and the restriction of R to V is a rotation 
in IR 2 . 


2 Recall that the transpose of a linear operator A : lU — - M. d is the linear operator 
B : R d — > R d which satisfies A(x) ■ y = x ■ B(y) for all 1,56 R d . We write B = A*. The 
inverse of A (when it exists) is the linear operator C : M. d — > R r/ with A o C = C o A = I 

(where I is the identity), and we write C = A -1 . 
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Geometrically, the vector 7 gives the direction of the axis of rotation. A 
proof of this fact is given in Exercise 2. Finally, if R is improper, then 
—R is proper (since in R 3 det(— R) = — det (R)), so R is the composition 
of a proper rotation and a symmetry with respect to the origin. 

Example 4. Given two orthonormal bases {ei, . . . , e^} and {e ' x , . . . , e' d } 
in R d , we can define a rotation R by letting i?(e») = e\ for i — 1, . . . , d. 
Conversely, if R is a rotation and {ei, . . . , ed} is an orthonormal basis, 
then {e^, . . . , e' d }, where e' = R(ej), is another orthonormal basis. 

1.2 Integration on 

Since we shall be dealing with functions on M d , we will need to discuss 
some aspects of integration of such functions. A more detailed review of 
integration on R. d is given in the appendix. 

A continuous complex-valued function / on R d is said to be rapidly 
decreasing if for every multi-index a the function \x a f(x)\ is bounded. 
Equivalently, a continuous function is of rapid decrease if 


sup \x\ k \f(x)\ < 00 for every k = 0, 1, 2 , . . .. 
xeRy 


Given a function of rapid decrease, we define 



where Qn denotes the closed cube centered at the origin, with sides of 
length N parallel to the coordinate axis, that is, 


Qn = {x € R d : \xi\ < N/2 for i — 1 , . . . , d}. 


The integral over Qn is a multiple integral in the usual sense of Riemann 
integration. That the limit exists follows from the fact that the integrals 
In — Jq n f(x) dx form a Cauchy sequence as N tends to infinity. 

Two observations are in order. First, we may replace the square Qn 
by the ball Bn — {x € R d : \x\ < N} without changing the definition. 
Second, we do not need the full force of rapid decrease to show that the 
limit exists. In fact it suffices to assume that / is continuous and 


sup |.T| d+e \f(x)\ < 00 for some e > 0. 
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For example, functions of moderate decrease on R correspond to e = 1. 
In keeping with this we define functions of moderate decrease on R d 
as those that are continuous and satisfy the above inequality with e = 1 . 


The interaction of integration with the three important groups of sym- 
metries is as follows: if f is of moderate decrease, then 

(i) f f(x + h) dx — f f(x) dx for all h G R d , 

Jm d J R d 

(ii) 5 d I f(6x) dx — f f(x) dx for all 5 > 0, 

J R d J R d 

(iii) / f(R(x)) dx= f(x) dx for every rotation R. 

J K d J R d 


Polar coordinates 

It will be convenient to introduce polar coordinates in R d and find the 
corresponding integration formula. We begin with two examples which 
correspond to the case d = 2 and d = 3. (A more elaborate discussion 
applying to all d is contained in the appendix.) 

Example 1. In R 2 , polar coordinates are given by (r, 6 ) with r > 0 and 
0 < 6 < 27T. The Jacobian of the change of variables is equal to r, so that 

p p2,7V n OG 

/ f(x)dx= / / /(rcos 6, r sin 9) r dr dd. 

J R 2 Jo Jo 

Now we may write a point on the unit circle S 1 as 7 = (cos 9, sin 9), and 
given a function g on the circle, we define its integral over S 1 by 


/‘‘I'K 


is 1 


< 7 ( 7 ) ^( 7 ( 7 ) = / < 7 (cos 9, sin 9) d9. 


With this notation we then have 



/(r'y) r dr da( 7 ). 


Example 2. In R 3 one uses spherical coordinates given by 

= r sin 9 cos ip, 

= r sin 9 sin tp, 

= rcos 9, 


Xl 

X2 

X 3 
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where 0 < r, 0 < # < 7r and 0 < tp < 2ir. The Jacobian of the change of 
variables is r 2 sin 8 so that 

/ f(x) dx = 

J R 3 

p2tv pTC rOO 

/ / / /(r sin 8 cos p, rsin 0sin p, r cos 0)r 2 dr sin 8 dO dip. 

Jo Jo Jo 

If g is a function on the unit sphere S 2 = {x € K 3 : |.t| — 1}, and 7 = 
(sin 8 cos <p, sin 8 sin ip, cos 8), we define the surface element dc >-(7) by 


('2'K /*7T 


Is 2 


g( 7 ) da( 7 ) = 


g( 7) sin 8 dd dtp. 


10 jo 


As a result. 


[ f(x)dx— [ [ f(r^)r 2 dr dcr (7). 

J r 3 Js 2 J 0 


In general, it is possible to write any point in M. d — {0} uniquely as 


x = r 7 


where 7 lies on the unit sphere S d ~ 1 C IR d and r > 0. Indeed, take r — \x\ 
and 7 = x/\x\. Thus one may proceed as in the cases d = 2 or d = 3 to 
define spherical coordinates. The formula we shall use is 




dr da{ 7), 


whenever / is of moderate decrease. Here dcr (7) denotes the surface 
element on the sphere S d ~ 1 obtained from the spherical coordinates. 


2 Elementary theory of the Fourier transform 

The Schwartz space <S(R d ) (sometimes abbreviated as S) consists of 
all indefinitely differentiable functions / on such that 


sup 

zeR 11 



< 00, 


for every multi-index a and /3. In other words, f and all its derivatives 
are required to be rapidly decreasing. 
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Example 1. An example of a function in S(R d ) is the d-dimensional 
Gaussian given by . The theory in Chapter 5 already made clear 

the central role played by this function in the case d — 1 . 

The Fourier transform of a Schwartz function / is defined by 

/(£) = f f(x)e~ 2nix ^dx, for £ € R d . 

Js. d 

Note the resemblance with the formula in one-dimension, except that we 
are now integrating on R rf , and the product of x and £ is replaced by the 
inner product of the two vectors. 

We now list some simple properties of the Fourier transform. In the 
next proposition the arrow indicates that we have taken the Fourier trans- 
form, so F(x) — > G(£) means that G(£) = F(£). 

Proposition 2.1 Let f £ <S(M d ). 

(i) f(x + h) — » /(£)e 27r ^' /l whenever h £ R d . 

(ii) f(x)e~ 2nixh — > /(£ + h) whenever h £ R d . 

(iii) f(5x) — > 5~ d /(<5 _1 £) whenever 5 > 0. 

( iv ) fix) — > (2t ri£)“/(0- 

(v) (— 2 tt ix) a f(x) — + /(£). 

(vi) f{Rx) — > /(i?£) whenever R is a rotation. 

The first five properties are proved in the same way as in the one- 
dimensional case. To verify the last property, simply change variables 
y = Rx in the integral. Then, recall that |det(f?)| = l, and 
R~ 1 y ■ £ = y ■ i?£, because R is a rotation. 

Properties (iv) and (v) in the proposition show that, up to factors of 
27 tz, the Fourier transform interchanges differentiation and multiplication 
by monomials. This motivates the definition of the Schwartz space and 
leads to the next corollary. 

Corollary 2.2 The Fourier transform maps S(R d ) to itself. 

At this point we disgress to observe a simple fact concerning the in- 
terplay between the Fourier transform and rotations. We say that a 
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function / is radial if it depends only on |x|; in other words, / is radial 
if there is a function / 0 (u), defined for u> 0, such that f\x) = /o(|x|). 
We note that / is radial if and only if f(Rx) = f(x) for every rotation 
R. In one direction, this is obvious since \Rx\ = \x\. Conversely, suppose 
that f{Rx) — f(x), for all rotations R. Now define /o by 



Note that /o is well defined, since if x and x' are points with |x| = \x'\ 
there is always a rotation R so that x' = Rx. 

Corollary 2.3 The Fourier transform of a radial function is radial. 

This follows at once from property (vi) in the last proposition. Indeed, 
the condition f(Rx) — f(x) for all R implies that f(Rf) = /(£) for all 
R 1 thus / is radial whenever / is. 

An example of a radial function in is the Gaussian e -7r l x l~. Also, 
we observe that when d — 1 , the radial functions are precisely the even 
functions, that is, those for which f{x) — f(—x). 

After these preliminaries, we retrace the steps taken in the previous 
chapter to obtain the Fourier inversion formula and Plancherel theorem 
for IR d . 

Theorem 2.4 Suppose f E S(M. d ). Then 



Moreover 



The proof proceeds in the following stages. 

Step 1. The Fourier transform of e _7r l x l~ is e _7r ^" . To prove this, 


notice that the properties of the exponential functions imply that 

g— "'M 2 — e ~ nx i . .. e ~ wx d an d g— 27rix-f _ e -2wix 1 -^ 1 , , . e -2mx d -£, d 

so that the integrand in the Fourier transform is a product of d functions, 
each depending on the variable Xj (1 < j < d ) only. Thus the assertion 
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follows by writing the integral over W* as a series of repeated integrals, 
each taken over R. For example, when d = 2, 




dx= f e - nx h- 2nix ^ 2 ( [ e~™*e~ 2nix ^dx j) dx 2 
Jr \J r J 

= f e - nx ' e - 2niX2 ^e- n ^ dx 2 


= e-^e-^ 


As a consequence of Proposition 2.1, applied with S 1 / 2 instead of <5, we 
find that (e -7r5 N 2 ) = <5 _d / 2 e _7r ^ 2//<5 . 

Step 2. The family K$(x) — §~ d / 2 e~' K ^ 2 is a family of good kernels 
in By this we mean that 

(i) [ K s (x)dx= 1, 

Jm d 

(ii) f \Kg(x) \ dx < M (in fact K$(x) > 0), 

J R d 

(iii) For every 77 > 0, / \Ks(x) \ dx — > 0 as <5 — > 0. 

J |x| >77 

The proofs of these assertions are almost identical to the case d — 1. As 
a result 

f K$(x)F{x) dx — > F(0) as 5 — > 0 

J R d 


when F is a Schwartz function, or more generally when F is bounded 
and continuous at the origin. 

Step 3. The multiplication formula 



f(y)a(y ) dy 


holds whenever / and g are in S. The proof requires the evaluation of the 
integral of f(x)g(y)e~ 2mx ' v over (x,y) G M. 2d = x M. d as a repeated 
integral, with each separate integration taken over R d . The justification 
is similar to that in the proof of Proposition 1.8 in the previous chapter. 
(See the appendix.) 

The Fourier inversion is then a simple consequence of the multiplica- 
tion formula and the family of good kernels as in Chapter 5. It also 
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follows that the Fourier transform T is a bijective map of S(R d ) to itself, 
whose inverse is 


^*(s)(®)=/ 3(0 

J R d 


e 2nix '£ d£. 


Step 4. Next we turn to the convolution, defined by 



We have that f*g£S(R d ), f*g = g*f, and (/* 5 ) (£) = /(£)<?(£>• 


The argument is similar to that in one-dimension. The calculation of the 


Fourier transform of f * g involves an integration of f(y)g(x — y)e 2mx '£ 


(over R 2d = R d x R d ) expressed as a repeated integral. 

Then, following the same argument in the previous chapter, we obtain 
the d-dimensional Plancherel formula, thereby concluding the proof of 
Theorem 2.4. 

3 The wave equation in R d xK 

Our next goal is to apply what we have learned about the Fourier trans- 
form to the study of the wave equation. Here, we once again simplify 
matters by restricting ourselves to functions in the Schwartz class S. We 
note that in any further analysis of the wave equation it is important to 
allow functions that have much more general behavior, and in particular 
that may be discontinuous. However, what we lose in generality by only 
considering Schwartz functions, we gain in transparency. Our study in 
this restricted context will allow us to explain certain basic ideas in their 
simplest form. 

3.1 Solution in terms of Fourier transforms 

The motion of a vibrating string satisfies the equation 


d 2 u 1 d 2 u 
dx 2 c 2 8 1 2 


which we referred to as the one-dimensional wave equation. 

A natural generalization of this equation to d space variables is 


(1) 


d 2 u d 2 u 1 d 2 u 

dx\ + dx 2 d c 2 dt 2 ' 


In fact, it is known that in the case d = 3, this equation determines the 
behavior of electromagnetic waves in vacuum (with c = speed of light) . 
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Also, this equation describes the propagation of sound waves. Thus (1) 
is called the d-dimensional wave equation. 

Our first observation is that we may assume c = 1 , since we can rescale 
the variable t if necessary. Also, if we define the Laplacian in d dimen- 
sions by 

A _ d 2 d 2 

~dxl + '" + d^ d ’ 


then the wave equation can be rewritten as 


( 2 ) 


A u — 


d 2 u 

W 


The goal of this section is to find a solution to this equation, subject 
to the initial conditions 


u(x,0) = f(x) and 


du 

~dt 


(x,0) = g(x ), 


where f,g G <S( IR d ). This is called the Cauchy problem for the wave 
equation. 

Before solving this problem, we note that while we think of the variable 
t as time, we do not restrict ourselves to t > 0. As we will see, the solution 
we obtain makes sense for all t £ M. This is a manifestation of the fact 
that the wave equation can be reversed in time (unlike the heat equation). 

A formula for the solution of our problem is given in the next theorem. 
The heuristic argument which leads to this formula is important since, as 
we have already seen, it applies to some other boundary value problems 
as well. 

Suppose u solves the Cauchy problem for the wave equation. The 
technique employed consists of taking the Fourier transform of the equa- 
tion and of the initial conditions, with respect to the space variables 
x\ , ,Xd- This reduces the problem to an ordinary differential equation 
in the time variable. Indeed, recalling that differentiation with respect to 
Xj becomes multiplication by and the differentiation with respect 

to t commutes with the Fourier transform in the space variables, we find 
that (2) becomes 

-4tt 2 |£| 2 fi(C,t) - 

For each fixed £ G M. d , this is an ordinary differential equation in t whose 
solution is given by 

u{£, t ) = A(f) cos(27ij£|f) + B(£) sin(27r|£|t), 
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where for each f, A(£) and B(f) are unknown constants to be determined 
by the initial conditions. In fact, taking the Fourier transform (in x) of 
the initial conditions yields 

u(t,0 ) = /(£) and — (£,0) = <?(£)• 

We may now solve for A(f) and B(f) to obtain 

MO = f(0 and 2 tt|^| B(f) = g(f). 


Therefore, we find that 

HOt) = /(Ocos(27r|e|t) + g(0 Sm ^| 1 * ) , 

and the solution u is given by taking the inverse Fourier transform in 
the £ variables. This formal derivation then leads to a precise existence 
theorem for our problem. 


Theorem 3.1 A solution of the Cauchy problem for the wave equation 
is 


(3) u(x,t ) 


f(0 cos(2tt|£| t)+g(f) 


sin(27r|£|t)" 

2vr|C| . 


e 2nix< df. 


Proof We first verify that u solves the wave equation. This is 
straightforward once we note that we can differentiate in x and t un- 
der the integral sign (because / and g are both Schwartz functions) and 
therefore u is at least C 2 . On the one hand we differentiate the expo- 
nential with respect to the x variables to get 


A u(x, t ) 


/(£)cos(27r|£|f) +g(0 


sin(27r|£|t)' 
2tt|£| . 


(-4n 2 \f\ 2 )e 2 ™-t df, 


while on the other hand we differentiate the terms in brackets with re- 
spect to t twice to get 


d 2 u 

W 


(x, t ) 


— 4tt 2 |£| 2 /(£) cos(27r|£|t) - 4 tt 2 |£| 2 <7(£) 


sin(27r|g|f) 

2vr|^| 


d 27TZX-^ 


df. 


This shows that u solves equation (2). Setting t = 0we get 


u(x,0)= f f(f)e 2 ™<df = f(x) 

J R d 
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by the Fourier inversion theorem. Finally, differentiating once with re- 
spect to t, setting t — 0, and using the Fourier inversion shows that 

^(*,0) = g{x). 

Thus u also verifies the initial conditions, and the proof of the theorem 
is complete. 

As the reader will note, both /(£) cos(27r|£|f) and g{£) are 
functions in S, assuming as we do that / and g are in S. This is be- 
cause both cos u and (sin it) /it are even functions that are indefinitely 
differentiable. 

Flaving proved the existence of a solution to the Cauchy problem for the 
wave equation, we raise the question of uniqueness. Are there solutions 
to the problem 


a <9 2 u 1 ■ 

An — —tv subject to 
at 2 


u(x, 0) = f(x) and 


du 

— {x,0)=g{x), 


other than the one given by the formula in the theorem? In fact the 
answer is, as expected, no. The proof of this fact, which will not be 
given here (but see Problem 3) , can be based on a conservation of energy 
argument. This is a local counterpart of a global conservation of energy 
statement which we will now present. 

We observed in Exercise 10, Chapter 3, that in the one-dimensional 
case, the total energy of the vibrating string is conserved in time. The 
analogue of this fact holds in higher dimensions as well. Define the 
energy of a solution by 



+ b 


du 

dx d 


2 


dx. 


Theorem 3.2 If u is the solution of the wave equation given by for- 
mula (3), then Eft) is conserved, that is, 


Eft ) = E( 0), for all t E K. 


The proof requires the following lemma. 

Lemma 3.3 Suppose a and b are complex numbers and a is real. Then 

iOi 1O11O11O 

|acos a + frsin a| + | — asina + 6cosa| = |a| + \b\ . 
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This follows directly because e\ = (cos a, sin a) and e 2 — (— sin ct, cos a) 
are a pair of orthonormal vectors, hence with Z = (a, b) £ C 2 , we have 

\Z\ 2 = \Z - ei \ 2 + \Z -e 2 \ 2 , 


where ■ represents the inner product in C 2 . 
Now by Plancherel’s theorem, 


[ 

du 

2 

dx = / 

Js. d 

dt 

J R d 


— 2tt|^| /(£) sin(27r|^|t) + g(£) cos(27r|£|f) d£. 


Similarly, 
/» d 


du 


d Xj 


dx — 


2tt|£|/(£) cos(27r|£|f) + <?(£) sin(2vr|^|t) 


dt 


We now apply the lemma with 

a — 2tt|£|/(£)> b = g(£) and a = 2n\^\t. 


The result is that 



Jm. d 


which is clearly independent of t. Thus Theorem 3.2 is proved. 

The drawback with formula (3), which does give the solution of the 
wave equation, is that it is quite indirect, involving the calculation of the 
Fourier transforms of f and g, and then a further inverse Fourier trans- 
form. However, for every dimension d there is a more explicit formula. 
This formula is very simple when d = 1 and a little less so when d = 3. 
More generally, the formula is “elementary” whenever d is odd, and more 
complicated when d is even (see Problems 4 and 5). 

In what follows we consider the cases d — 1, d = 3, and d = 2, which 
together give a picture of the general situation. Recall that in Chapter 1, 
when discussing the wave equation over the interval [0, L], we found that 
the solution is given by d’Alembert’s formula 


f{x + t) + f(x 


t) 1 f x+t , , , 
— + o 9(y)dy. 
z Jx-t 


(4) 


u(x, t) 


2 
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with the interpretation that both / and g are extended outside [0, L\ by 
making them odd in [—L,L\, and periodic on the real line, with period 
2 L. The same formula (4) holds for the solution of the wave equation 
when d = 1 and when the initial data are functions in 5(M). In fact, this 
follows directly from (3) if we note that 

cos(27r|£|t) - + e“ 2 ™ W) 

and 

Sill ( 27r l^) = 1 _ -2 «|£| U 

27T|e| 4 tu |£| 1 

Finally, we note that the two terms that appear in d’Alembert’s for- 
mula (4) consist of appropriate averages. Indeed, the first term is pre- 
cisely the average of / over the two points that are the boundary of the 
interval [x — t, x + t]; the second term is, up to a factor of t, the mean 
value of g over this interval, that is, (l/2t) f^-t g(y) dy. This suggests a 
generalization to higher dimensions, where we might expect to write the 
solution of our problem as averages of the initial data. This is in fact the 
case, and we now treat in detail the particular situation d — 3. 

3.2 The wave equation in l 3 xl 

If S 2 denotes the unit sphere in K 3 , we define the spherical mean of 
the function / over the sphere of radius t centered at x by 

(5) M t (f)(x) = ^ J f(x - tj) dc r( 7 ), 

where da(~/) is the element of surface area for S 2 . Since 47 T is the area 
of the unit sphere, we can interpret M t (f) as the average value of / over 
the sphere centered at x of radius t. 

Lemma 3.4 Iff 6 <S(R 3 ) andt is fixed, then M t (f) G <S(R 3 ). Moreover, 
Mt(f) is indefinitely differentiable in t, and each t-derivative also belongs 
to S(R 3 ). 

Proof. Let F{x ) = M t (f)(x). To show that F is rapidly decreasing, 
start with the inequality \f(x)\ <Ajv/(l + |ar| JV ) which holds for every 
fixed N > 0. As a simple consequence, whenever t is fixed, we have 

| f(x — qf)| < A' n /( 1 + \x\ N ) for all 7 G S 2 . 
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To see this consider separately the cases when \x\ < 2|t|, and \x\ > 2|t|. 
Therefore, by integration 

|F(x)| < A ' n / (1 + W"), 

and since this holds for every N, the function F is rapidly decreasing. 
One next observes that F is indefinitely differentiable, and 

(6) (j^Y F(x) = M t {f&)(x) 

where f^ a \x) — (d/dx) a f. It suffices to prove this when ( d/dx) a — 
d/dxk, and then proceed by induction to get the general case. Further- 
more, it is enough to take k — 1. Now 


F{x i + h, x 2 ,x 3 ) - Fjx i, x 2 , x 3 ) _ 

h Air 


9h.il) d&il) 


where 

f(x + ei h - jt) - f(x - 7 1 ) 

9h (7) = /i , 

and ei = (1,0,0). Now, it suffices to observe that g h — > fix — jt) 
as h — ► 0 uniformly in 7. As a result, we find that (6) holds, and by 
the first argument, it follows that F(x) is also rapidly decreasing, 

hence F € S. The same argument applies to each f-derivative of 


The basic fact about integration on spheres that we shall need is the 
following Fourier transform formula. 

Lemma 3.5 - / e“ 2 ^' 7 d<r( 7 ) = Sin ( 2 ^l. 

4tt y S 2 v 2tt|£| 

This formula, as we shall see in the following section, is connected to 
the fact that the Fourier transform of a radial function is radial. 

Proof. Note that the integral on the left is radial in £. Indeed, if R is 
a rotation then 


[ e -2«R(SH d CT ( 7 ) = f g-2 *it-R 1 (7) d(7 ( 7 ) = f g-27r«. 7 d<T ( 7 ) 

Js 2 Js 2 Js 2 


because we may change variables 7 — > R 1 (j). (For this, see formula (4) 
in the appendix.) So if |£| = p, it suffices to prove the lemma with 
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£ = (0, 0,p). If p — 0, the lemma is obvious. If p > 0, we choose spherical 
coordinates to find that the left-hand side is equal to 

1 f 271 

47T 


0 —2i rip cos 0 


sin# d6 dip. 


i o Jo 


The change of variables u = — cos 6 gives 


— [ 2 * T e~ 27Tipcose sin Odd dp = - [ n e~ 2 ' Kip cos e sin 9 dd 

47r Jo Jo 2 J 0 


1 
2 

1 r 

2 /_! 


e 2nipu du 


^ \p2-n-ipul 1 

47Tip L 
sin(27 rp) 

2irp ’ 


and the formula is proved. 


By the defining formula (5) we may interpret Mt(f) as a convolution 
of the function f with the element da, and since the Fourier transform 
interchanges convolutions with products, we are led to believe that M t (f) 
is the product of the corresponding Fourier transforms. Indeed, we have 
the identity 


(7) 

To see this, write 

W)(0 - [ e 
Jr 




-2'kix-^ 


47 r . 


sin( 27 r|£|t) 

27 r|£|t 

^ f(x ~ it) da(j) ) dx, 


and note that we may interchange the order of integration and make a 
simple change of variables to achieve the desired identity. 

As a result, we find that the solution of our problem may be expressed 
by using the spherical means of the initial data. 

Theorem 3.6 The solution when d = 3 of the Cauchy problem for the 
wave equation 


A u — 


d 2 u 


dt 2 
is given by 


du 

subject to u{x, 0) = f{x) and —(x,0)=g(x) 


u i x ,t) = g^(tM t {f){x)) +tM t (g)(x). 
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Proof. Consider first the problem 

d^u du 

A u= ~of 2 subject to it(x, 0) = 0 and — (x, 0) = g(x). 

Then by Theorem 3.1, we know that its solution u\ is given by 


u 1 


(x,t) = f 

J i= 


m 


sin(27ij£|i) 


0 27rzrr-£ 


= t 


9(f) 


2tt|£| 
sin(27r|£|t) 
27r|^|t 


df 


^TTIX-^ 


df 


= tM t (g)(x ), 


where we have used (7) applied to g , and the Fourier inversion formula. 
According to Theorem 3.1 again, the solution to the problem 

d^u du 

A u = subject to u( x, 0) = f(x) and — (aj,0) = 0 

is given by 


u 2 (x,t) = / /(£)cos(27r|£|f) 


0 2ir ix-£ 


df 


m 


= ^~ t (tM t {f)(x)). 


sin(27r|^|t) 

27r|^|t 


0 2l fiX-% 


df 


We may now superpose these two solutions to obtain u = u\ + u 2 as the 
solution of our original problem. 


Huygens principle 

The solutions to the wave equation in one and three dimensions are given, 
respectively, by 


u(x, t ) 


f(x + t) + /( x 
2 


z Jx-t 


u{x,t) = tM t (f)(x )) + tM t (g){x). 


and 
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We observe that in the one-dimensional problem, the value of the solution 
at (x,t) depends only on the values of / and g in the interval centered 
at x of length 2 1, as shown in Figure 1. 

If in addition g — 0, then the solution depends only on the data at the 
two boundary points of this interval. In three dimensions, this boundary 
dependence always holds. More precisely, the solution u{x , t) depends 
only on the values of / and g in an immediate neighborhood of the sphere 
centered at x and of radius t. This situation is depicted in Figure 2, where 
we have drawn the cone originating at (x, t ) and with its base the ball 
centered at x of radius t. This cone is called the backward light cone 
originating at (x. t). 


(x, t) 



Figure 2. Backward light cone originating at (x,t) 


Alternatively, the data at a point x$ in the plane t = 0 influences the 
solution only on the boundary of a cone originating at xq, called the 
forward light cone and depicted in Figure 3. 

This phenomenon, known as the Huygens principle, is immediate 
from the formulas for u given above. 

Another important aspect of the wave equation connected with these 
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Figure 3. The forward light cone originating at x 0 


considerations is that of the finite speed of propagation. (In the 
case where c — 1 , the speed is 1 . ) This means that if we have an initial 
disturbance localized at x — xq, then after a finite time t, its effects will 
have propagated only inside the ball centered at xo of radius \t\. To state 
this precisely, suppose the initial conditions / and g are supported in the 
ball of radius <5, centered at xq (think of S as small). Then u(x,t) is 
supported in the ball of radius |f| + S centered at xq. This assertion is 
clear from the above discussion. 

3.3 The wave equation in R 2 x M: descent 

It is a remarkable fact that the solution of the wave equation in three 
dimensions leads to a solution of the wave equation in two dimensions. 
Define the corresponding means by 

M t (F)(x) = [ F(x - ty)( 1 - M 2 r 1/2 dy. 

271 J\v\< i 

Theorem 3.7 A solution of the Cauchy problem for the wave equation 
in two dimensions with initial data f,g 6 <S(R 2 ) is given by 

(8) u(x,t) = J^( tM t (f)(x )) + tM t (g)(x). 

Notice the difference between this case and the case d = 3. Here, u at 
(x,t) depends on / and g in the whole disc (of radius \t\ centered at x), 
and not just on the values of the initial data near the boundary of that 
disc. 

Formally, the identity in the theorem arises as follows. If we start 
with an initial pair of functions / and g in 5(R 2 ), we may consider the 
corresponding functions / and g on IR 3 that are merely extensions of / 
and g that are constant in the X 3 variable, that is, 

f(xi,x 2 ,x 3 ) = f{xi,x 2 ) and 5 (^ 1 , x 2 , x 3 ) = g(x 1 , x 2 ). 
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Now, if u is the solution (given in the previous section) of the 3-dimensional 
wave equation with initial data / and g, then one can expect that u is 
also constant in £ 3 so that u satisfies the 2 -dimensional wave equation. 
A difficulty with this argument is that / and g are not rapidly decreasing 
since they are constant in £ 3 , so that our previous methods do not apply. 
However, it is easy to modify the argument so as to obtain a proof of 
Theorem 3.7. 

We fix T > 0 and consider a function 77 (£3) that is in <S(K), such that 
77(0:3) = 1 if |x 3 | < 3 T. The trick is to truncate / and g in the £3- variable, 
and consider instead 

f b {x 1 ,x 2 ,x 3 ) = f(xi,x 2 )r](x 3 ) and g" (x 1 ,x 2 ,x 3 ) = g(x 1 ,x 2 )g(x 3 ). 

Now both /' b and g° are in <S(R 3 ), so Theorem 3.6 provides a solution 
vJ of the wave equation with initial data f b and g' > . It is easy to see 
from the formula that vJ(x,t) is independent of £ 3 , whenever |£ 3 | < T 
and |t| < T. In particular, if we define u{x\,x 2 ,i) = ift{x\, x 2 , 0, t), then 
u satisfies the 2-dimensional wave equation when \t\ <T. Since T is 
arbitrary, u is a solution to our problem, and it remains to see why u has 
the desired form. 

By definition of the spherical coordinates, we recall that the integral 
of a function H over the sphere S 2 is given by 



I / H (sin 9 cosip, sin 6 simp, cos 0) sin 6 dd dip. 

0 Jo 


If H does not depend on the last variable, that is, H(x i,x 2 ,x 3 ) = h(x \,x 2 ) 
for some function h of two variables, then 


M t (H)(x 1, £2, 0 ) = 



To calculate this last integral, we split the ^-integral from 0 to 7 r /2 and 
then 7t/2 to 7 r. By making the change of variables r = sin 6, we find, after 
a final change to polar coordinates, that 



M t (h)(x i,£ 2 ). 
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Applying this to H = f b , h = /, and H = g b , h = g, we find that u is 
given by the formula (8), and the proof of Theorem 3.7 is complete. 

Remark. In the case of general d, the solution of the wave equation 
shares many of the properties we have discussed in the special cases 
d— 1,2, and 3. 

• At a given time t, the initial data at a point x only affects the solu- 
tion u in a specific region. When d > 1 is odd, the data influences 
only the points on the boundary of the forward light cone origi- 
nating at x, while when d — 1 or d is even, it affects all points of 
the forward light cone. Alternatively, the solution at a point (x,t) 
depends only on the data at the base of the backward light cone 
originating at (x, t). In fact, when d > 1 is odd, only the data in an 
immediate neighborhood of the boundary of the base will influence 
u(x, t). 

• Waves propagate with finite speed: if the initial data is supported 
in a bounded set, then the support of the solution u spreads with 
velocity 1 (or more generally c, if the wave equation is not normal- 
ized). 

We can illustrate some of these facts by the following observation about 
the different behavior of the propagation of waves in three and two dimen- 
sions. Since the propagation of light is governed by the three-dimensional 
wave equation, if at t = 0 a light flashes at the origin, the following hap- 
pens: any observer will see the flash (after a finite amount of time) only 
for an instant. In contrast, consider what happens in two dimensions. If 
we drop a stone in a lake, any point on the surface will begin (after some 
time) to undulate; although the amplitude of the oscillations will decrease 
over time, the undulations will continue (in principle) indefinitely. 

The difference in character of the formulas for the solutions of the 
wave equation when d = 1 and d — 3 on the one hand, and d — 2 on 
the other hand, illustrates a general principle in d-dimensional Fourier 
analysis: a significant number of formulas that arise are simpler in the 
case of odd dimensions, compared to the corresponding situations in even 
dimensions. We will see several further examples of this below. 

4 Radial symmetry and Bessel functions 

We observed earlier that the Fourier transform of a radial function in 
R d is also radial. In other words, if f(x) — fo(\x\) for some f 0 , then 
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/(£) = .F 0 (|£|) for some F 0 . A natural problem is to determine a relation 


between f 0 and F 0 . 

This problem has a simple answer in dimensions one and three. If 
d = 1 the relation we seek is 



If we recall that R has only two rotations, the identity and multiplication 
by —1, we find that a function is radial precisely when it is even. Having 
made this observation it is easy to see that if / is radial, and |£| = p, 
then 



Jo 

In the case d = 3, the relation between /o and Fq is also quite simple 
and given by the formula 



( 10 ) 


The proof of this identity is based on the formula for the Fourier trans- 
form of the surface element da given in Lemma 3.5: 



More generally, the relation between / 0 and To has a nice description 
in terms of a family of special functions that arise naturally in problems 
that exhibit radial symmetry. 

The Bessel function of order n € Z, denoted J n (p), is defined as the 
n th Fourier coefficient of the function e* psin£l . So 
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therefore 

OO 

e ip sin 8 = J2 J n (p)e ind . 

n= — OO 


As a result of this definition, we find that when d = 2, the relation be- 
tween /o and Fq is 

poo 

(11) F 0 (p) — 2n Jo(2irrp)f 0 (r)r dr. 

Jo 

Indeed, since /(£) is radial we take £ = (0, —p) so that 


/(£)= [ f(x)e 2nix <°’ri dx 

Jr 2 

p2ir poo 

= f 0 (r)e 2nirpsine r dr d6 

Jo Jo 

poo 

= 27 T Jo(2irrp)fo(r )r dr, 

Jo 


as desired. 

In general, there are corresponding formulas relating /q and Fq in M. d 
in terms of Bessel functions of order d/2 — 1 (see Problem 2). In even 
dimensions, these are the Bessel functions we have defined above. For 
odd dimensions, we need a more general definition of Bessel functions to 
encompass half-integral orders. Note that the formulas for the Fourier 
transform of radial functions give another illustration of the differences 
between odd and even dimensions. When d — 1 or d = 3 (as well as 
d > 3, d odd) the formulas are in terms of elementary functions, but this 
is not the case when d is even. 

5 The Radon transform and some of its applications 

Invented by Johann Radon in 1917, the integral transform we discuss 
next has many applications in mathematics and other sciences, includ- 
ing a significant achievement in medicine. To motivate the definitions 
and the central problem of reconstruction, we first present the close con- 
nection between the Radon transform and the development of X-ray 
scans (or CAT scans) in the theory of medical imaging. The solution of 
the reconstruction problem, and the introduction of new algorithms and 
faster computers, all contributed to a rapid development of computerized 
tomography. In practice, X-ray scans provide a “picture” of an internal 
organ, one that helps to detect and locate many types of abnormalities. 
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After a brief description of X-ray scans in two dimensions, we define 
the X-ray transform and formulate the basic problem of inverting this 
mapping. Although this problem has an explicit solution in IR 2 , it is 
more complicated than the analogous problem in three dimensions, hence 
we give a complete solution of the reconstruction problem only in IR 3 . 
Here we have another example where results are simpler in the odd- 
dimensional case than in the even-dimensional situation. 

5.1 The X-ray transform in IR 2 

Consider a two-dimensional object O lying in the plane IR 2 , which we 
may think of as a planar cross section of a human organ. 

First, we assume that O is homogeneous, and suppose that a very 
narrow beam of X-ray photons traverses this object. 



Figure 4. Attenuation of an X-ray beam 


If la and / denote the intensity of the beam before and after passing 
through O, respectively, the following relation holds: 


/ = I 0 e~ dp . 


Here d is the distance traveled by the beam in the object, and p denotes 
the attenuation coefficient (or absorption coefficient), which depends on 
the density and other physical characteristics of O. If the object is not 
homogeneous, but consists of two materials with attenuation coefficients 
Pi and P2, then the observed decrease in the intensity of the beam is 
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given by 

/ = I 0 e~ dlpl ~ d2p2 

where d\ and denote the distances traveled by the beam in each ma- 
terial. In the case of an arbitrary object whose density and physical 
characteristics vary from point to point, the attenuation factor is a func- 
tion p in IR 2 , and the above relations become 

I = I 0 e^ p . 

Here L is the line in IR 2 traced by the beam, and f L p denotes the line 
integral of p over L. Since we observe / and Iq, the data we gather 
after sending the X-ray beam through the object along the line L is the 
quantity 



Since we may initially send the beam in any given direction, we may 
calculate the above integral for every line in IR 2 . We define the X-ray 
transform (or Radon transform in IR 2 ) of p by 

X(p)(L) = j L p. 

Note that this transform assigns to each appropriate function p on IR 2 
(for example, p € tS (IR 2 )) another function X(p) whose domain is the set 
of lines L in IR 2 . 

The unknown is p , and since our original interest lies precisely in the 
composition of the object, the problem now becomes to reconstruct the 
function p from the collected data, that is, its X-ray transform. We 
therefore pose the following reconstruction problem: Find a formula for 
p in terms of X(p). 

Mathematically, the problem asks for a formula giving the inverse of 
X. Does such an inverse even exist? As a first step, we pose the following 
simpler uniqueness question: If X{p) = X(p'), can we conclude that p — 

There is a reasonable a priori expectation that X(p) actually deter- 
mines p, as one can see by counting the dimensionality (or degrees of 
freedom) involved. A function p on IR 2 depends on two parameters (the 
x\ and X 2 coordinates, for example). Similarly, the function X(p), which 
is a function of lines L , is also determined by two parameters (for ex- 
ample, the slope of L and its ^-intercept) . In this sense, p and X(p) 
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convey an equivalent amount of information, so it is not unreasonable to 
suppose that X(p) determines p. 

While there is a satisfactory answer to the reconstruction problem, 
and a positive answer to the uniqueness question in M. 2 , we shall forego 
giving them here. (However, see Exercise 13 and Problem 8.) Instead 
we shall deal with the analogous but simpler situation in R 3 . 

Let us finally remark that in fact, one can sample the X-ray trans- 
form, and determine X(p)(L) for only finitely many lines. Therefore, 
the reconstruction method implemented in practice is based not only on 
the general theory, but also on sampling procedures, numerical approx- 
imations, and computer algorithms. It turns out that a method used 
in developing effective relevant algorithms is the fast Fourier transform, 
which incidentally we take up in the next chapter. 

5.2 The Radon transform in R 3 

The experiment described in the previous section applies in three dimen- 
sions as well. If O is an object in IR 3 determined by a function p which 
describes the density and physical characteristics of this object, sending 
an X-ray beam through O determines the quantity 



for every line in R 3 . In R 2 this knowledge was enough to uniquely de- 
termine p, but in R 3 we do not need as much data. In fact, by using the 
heuristic argument above of counting the number of degrees of freedom, 
we see that for functions p in R 3 the number is three, while the number 
of parameters determining a line L in R 3 is four (for example, two for 
the intercept in the (xi, £ 2 ) plane, and two more for the direction of the 
line). Thus in this sense, the problem is over-determined. 

We turn instead to the natural mathematical generalization of the two- 
dimensional problem. Here we wish to determine the function in R 3 by 
knowing its integral over all planes 3 in R 3 . To be precise, when we speak 
of a plane, we mean a plane not necessarily passing through the origin. 
If V is such a plane, we define the Radon transform 7 Z(f) by 

mm = [ /• 

Jv 

To simplify our presentation, we shall follow our practice of assuming 
that we are dealing with functions in the class <S(R 3 ). However, many 


3 Note that the dimensionality associated with points on M 3 , and that for planes in R 3 , 
equals three in both cases. 
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of the results obtained below can be shown to be valid for much larger 
classes of functions. 

First, we explain what we mean by the integral of / over a plane. The 
description we use for planes in IR 3 is the following: given a unit vector 
7 £ S 2 and a number t £ IR, we define the plane T \ 7 by 



: x • 7 — t}. 


So we parametrize a plane by a unit vector 7 orthogonal to it, and by its 
“distance” t to the origin (see Figure 5). Note that = V-t _ 7 , and 
we allow t to take negative values. 



Figure 5. Description of a plane in IR 3 


Given a function / £ S(R d ), we need to make sense of its integral over 
V lrr We proceed as follows. Choose unit vectors ei, e 2 so that ei, e 2, 7 is 
an orthonormal basis for IR 3 . Then any x £ Vt ,7 can be written uniquely 
as 

x = tj + u where u = uiei + rt 2 e 2 with m, u 2 € IR. 


If / £ <S(IR 3 ), we define 


( 12 ) 


/ / = / fit 7 + uiei + u 2 e 2 ) du x du 2 

JVt , 7 Jr 2 


To be consistent, we must check that this definition is independent of 
the choice of the vectors ei, e 2 . 
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Proposition 5.1 If f £ <S(R 3 ), then for each 7 the definition of f Vt f 
is independent of the choice of e\ and e 2 . Moreover 



Proof. If e! x ,e! 2 is another choice of basis vectors so that 7 , e \ , e' 2 is 
orthonormal, consider the rotation R in R 2 which takes e\ to e[ and 
e 2 to e' 2 - Changing variables v! = R(u) in the integral proves that our 
definition ( 12 ) is independent of the choice of basis. 

To prove the formula, let R denote the rotation which takes the stan- 
dard basis of unit vectors 4 in R 3 to 7 , ei, and e 2 - Then 


f(x) dx = [ f(Rx) dx 

Js. 3 

= / f( x i7 + x 2 ei + x 3 e 2 ) dx 1 dx 2 dx 3 
- £(/,./) - 


Remark. We digress to point out that the X-ray transform deter- 
mines the Radon transform, since two-dimensional integrals can be ex- 
pressed as iterated one-dimensional integrals. In other words, the knowl- 
edge of the integral of a function over all lines determines the integral of 
that function over any plane. 

Having disposed of these preliminary matters, we turn to the study of 
the original problem. The Radon transform of a function / £ <S(R 3 ) 
is defined by 

^(/)(*. 7 )=/ /• 

Jv t , T 

In particular, we see that the Radon transform is a function on the 
set of planes in R 3 . From the parametrization given for a plane, we 
may equivalently think of 7 Z(f) as a function on the product RxS 2 = 
{ (t, 7 ) : t £ R, 7 £ S 2 }, where S 2 denotes the unit sphere in R 3 . The 
relevant class of functions on R x S 2 consists of those that satisfy the 
Schwartz condition in t uniformly in 7 . In other words, we define 5(Rx 
S 2 ) to be the space of all continuous functions F(t, 7 ) that are indefinitely 


4 Here we are referring to the vectors (1, 0, 0), (0, 1, 0), and (0, 0, 1). 


Ibookroot October 20, 2007 


204 Chapter 6. THE FOURIER TRANSFORM ON : 

differentiable in t, and that satisfy 


sup \t\ k 

tens, ■yes 2 


d e F 


8F (t ' 7) 


< oo for all integers k, l > 0 . 

Our goal is to solve the following problems. 

Uniqueness problem: If 7 Z(f) = 71(g), then f = g. 


Reconstruction problem: Express / in terms of 71(f). 


The solutions will be obtained by using the Fourier transform. In fact, 
the key point is a very elegant and essential relation between the Radon 
and Fourier transforms. 

Lemma 5.2 If f E <S(R 3 ), then 7l(f)(t, 7 ) G <S(R) for each fixed 7 . More- 
over, 

K(f)(s,i) = f(s'y). 

To be precise, / denotes the (three-dimensional) Fourier transform 
of /, while 7l(f)(s,j) denotes the one-dimensional Fourier transform of 
71(f) (t, 7 ) as a function of t, with 7 fixed. 

Proof. Since / € <S(R 3 ), for every positive integer N there is a con- 
stant < 00 so that 

(1 + ^1)^(1 + \u\) N \f(tj + u)\ < A n , 


if we recall that x = t'y + u, where 7 is orthogonal to u. Therefore, as 
soon as IV > 3, we find 


(1 + \t\) N n(f)(t, 7 )<a n [ </?i < 00 . 

J R 2 (1 + |m|) 

A similar argument for the derivatives shows that IZ(f)(t, r y) € <S(M) for 
each hxed 7 . 

To establish the identity, we first note that 


Tl(f)(s, 7 ) - 


/ — 00 \JV , 
poo 


f )e 


— 2tt ist 


dt 


*.7 


f (t^/ + uyey + u 2 e 2 ) duy du 2 e mst dt. 
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However, since 7 ■ u — 0 and (7) = 1, we may write 

g— 27 t ist g — 27ris7-(t7+n) 


As a result, we find that 

= [ [ f{tl + uiei + u 2 e2)e -27nsr(:t7+ “ ) du\ du 2 dt 

J — 00 Jm 2 

= f °° f /(*7 + u ) e - 2 ™*r(*r-H*) dudit. 

J — 00 Jm 2 

A final rotation from 7, ei,e2 to the standard basis in IR 3 proves that 
K(f)(s, 7) = f(s 7), as desired. 

As a consequence of this identity, we can answer the uniqueness ques- 
tion for the Radon transform in IR 3 in the affirmative. 

Corollary 5.3 If /, g £ <S(M 3 ) and 1Z(f) = 'R-(g), then f — g. 

The proof of the corollary follows from an application of the lemma to 
the difference / — g and use of the Fourier inversion theorem. 

Our final task is to give the formula that allows us to recover / from 
its Radon transform. Since 11(f) is a function on the set of planes in 
R 3 , and / is a function of the space variables x € M 3 , to recover / we 
introduce the dual Radon transform, which passes from functions defined 
on planes to functions in R 3 . 

Given a function F on R x S' 2 , we define its dual Radon transform 
by 

(13) n*(F)(x)= j F(x ■ 7,7) da(j). 

Js 2 

Observe that a point x belongs to P t]7 if and only if x ■ 7 = t, so the idea 
here is that given x € IR 3 , we obtain lZ*(F)(x) by integrating F over the 
subset of all planes passing through x, that is, 

1Z*(F)(x) = f F, 

J {Vt ,’ 7 such that xEVt , 7 } 

where the integral on the right is given the precise meaning in (13). 
We use the terminology “dual” because of the following observation. If 
Vf = <S(1R 3 ) with the usual Hermitian inner product 

(f,9) 1 = f f(x)g(x)dx , 

Jr 3 
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and V 2 = <S(K x S 2 ) with the Hermitian inner product 

(F,G) 2 = f I F{t,i)G{t,i)da{i)dt, 

J K JS 2 

then 


1Z:V 1 ^V 2 , n*:V 2 ^V 1 , 


with 


(14) (7 lf,F) 2 = (f,n*F) 1 . 

The validity of this identity is not needed in the argument below, and 
its verification is left as an exercise for the reader. 

We can now state the reconstruction theorem. 

Theorem 5.4 If f £ <S(R 3 ), then 

A = —87 r 2 /. 

We recall that A = ff-? + + S-g is the Laplacian. 

C/X 2 C/X 2 C/tCg 

Proof. By our previous lemma, we have 

/ ' OO 

f{ S1 y™ ts ds. 

-00 

Therefore 

n*n(f){x)= [ l™ f^y-^dsd^i 

J S 2 J — 00 


hence 


A(TZ*IZ{f))(x) = f [ f {s^){—A-k 2 s 2 )e 2mx ' ls ds da(j) 

J S 2 J — 00 


= — 47T 2 
— — 47T 2 


' —00 
r 0 


f(sj)e 2nix ^ s s 2 dsda ( 7 ) 
f(s r y)e 27rix ' ls s 2 dsda ( 7 ) 


— 47 t 2 [ [ f(s r y)e 2mx '' YS s 2 dsda( r y) 

Js 2 Jo 

= — 8n 2 [ [ f(s^)e 2ntx ' ls s 2 ds da("/) 

Js 2 Xo 

= - 8? r 2 /(x). 
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In the first line, we have differentiated under the integral sign and used 
the fact A(e 27r * x ' 7S ) = (— 47T 2 s 2 )e 27r **' 7S , since |7| = 1. The last step fol- 
lows from the formula for polar coordinates in M 3 and the Fourier inver- 
sion theorem. 


5.3 A note about plane waves 

We conclude this chapter by briefly mentioning a nice connection between 
the Radon transform and solutions of the wave equation. This comes 
about in the following way. Recall that when d— 1, the solution of 
the wave equation can be expressed as the sum of traveling waves (see 
Chapter 1), and it is natural to ask if an analogue of such traveling 
waves exists in higher dimensions. The answer is as follows. Let F be 
a function of one variable, which we assume is sufficiently smooth (say 
C 2 ), and consider u{x,t) defined by 

u(x,t) = F((x ■ 7) - t), 

where x € R d and 7 is a unit vector in R d . It is easy to verify directly 
that u is a solution of the wave equation in W l (with c — 1). Such a 
solution is called a plane wave; indeed, notice that u is constant on 
every plane perpendicular to the direction 7, and as time t increases, the 
wave travels in the 7 direction. (It should be remarked that plane waves 
are never functions in <S(IR. d ) when d > 1 because they are constant in 
directions perpendicular to 7). 5 

The basic fact is that when d > 1, the solution of the wave equation 
can be written as an integral (as opposed to sum, when d — 1) of plane 
waves; this can in fact be done via the Radon transform of the initial 
data / and g. For the relevant formulas when d — 3, see Problem 6. 

6 Exercises 

1. Suppose that R is a rotation in the plane R 2 , and let 


denote its matrix with respect to the standard basis vectors e\ = (1,0) and 
e 2 = (0,l). 

(a) Write the conditions R l = R~ x and det(R) = ±1 in terms of equations in 
a, 6, c, d. 


5 Incidentally, this observation is further indication that a fuller treatment of the wave 
equation requires lifting the restriction that functions belong to S(M. d ). 
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(b) Show that there exists ip G R such that a + ib = e lip . 

(c) Conclude that if R is proper, then it can be expressed as £ i — * ze and if 
R is improper, then it takes the form z i— ► ze lv , where z = x — iy. 


2. Suppose that R : R 3 — » R 3 is a proper rotation. 

(a) Show that p(t) = det(i? — tl) is a polynomial of degree 3, and prove that 
there exists 7 G S 2 (where S 2 denotes the unit sphere in R 3 ) with 

R{l) = 7 - 

[Hint: Use the fact that p{ 0) > 0 to see that there is A > 0 with p( A) = 0. 
Then R — XI is singular, so its kernel is non-trivial.] 

(b) If V denotes the plane perpendicular to 7 and passing through the origin, 
show that 


R-.r^r, 

and that this linear map is a rotation. 


3. Recall the formula 


f F{x)dx= j j F(r^)r d 1 drdcr( 7). 

J R d J S 11 - 1 Jo 

Apply this to the special case when F(x) = girffi'y), where x = r'y, to prove 
that for any rotation R , one has 



f(R(l)) do'i'y) = 



f(nf)d<r(i), 


whenever / is a continuous function on the sphere S d 1 


4. Let Ad and Vd denote the area and volume of the unit sphere and unit ball 
in R d , respectively. 

(a) Prove the formula 


27T d/2 

d ~ T(d/2) 


so that A 2 = 27r, A 3 = 47r, A4 = 2i r 2 , .... Here r(x) = / 0 °° e t 
the Gamma function. [Hint: Use polar coordinates and the 
f Rd e~"M 2 dx = l.] 


t 00 - 1 dt is 
fact that 
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(b) Show that dV d = A d , hence 


V d 


7r d / 2 

T(d/2 + 1) ' 


In particular Vp = n, V 3 = 47r/3, . . .. 
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5. Let A be a d x d positive definite symmetric matrix with real coefficients. 
Show that 



ix,A(. x ))dx = (det(^))“ 1/2 . 


This generalizes the fact that f Rd e dx = 1, which corresponds to the case 
where A is the identity. 

[Hint: Apply the spectral theorem to write A = RDR -1 where R is a rotation 
and, D is diagonal with entries Ai, . . . , X d , where { } are the eigenvalues of A.] 


6. Suppose ip G <S(R d ) satisfies f \tp(x )\ 2 dx = 1. Show that 

(Jjtfmt’x) > jjp- 

This is the statement of the Heisenberg uncertainty principle in d dimensions. 


7. Consider the time-dependent heat equation in 
. , du d 2 u d 2 u 

(15) ® = sf + "- + gJi' wheret>0 > 

with boundary values u(x,0) = f(x) G 5(1^). If 

' H t d \x) = - ( T7w^ e “ |x|2/4t = / 

(4tt t ) d / 2 V 

is the d-dimensional heat kernel, show that the convolution 

u(x,t) = (/ *n\ d) )(x) 

is indefinitely differentiable when x G l rf and t > 0. Moreover, u solves (15), and 
is continuous up to the boundary t = 0 with u(x , 0) = f(x). 

The reader may also wish to formulate the d-dimensional analogues of Theo- 
rem 2.1 and 2.3 in Chapter 5. 


8. In Chapter 5, we found that a solution to the steady-state heat equation in the 
upper half-plane with boundary values / is given by the convolution u = / *V y 
where the Poisson kernel is 


V v (x) 


1 y 

7t x 2 T y 2 


where iGt and y > 0. 
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More generally, one can calculate the d-dimensional Poisson kernel using the 
Fourier transform as follows. 


(a) The subordination principle allows one to write expressions involv- 
ing the function e~ x in terms of corresponding expressions involving the 
function e~ x . One form of this is the identity 


e-e = 


-e~P 2 l Au du 


when j3 > 0. Prove this identity with f3 = 27r|rc| by taking the Fourier 
transform of both sides. 

(b) Consider the steady-state heat equation in the upper half-space {{x,y) : 
i£R j , y > 0} 


d 


E 


d 2 u 

dx ) 


+ 


d 2 u 

w 


= o 


with the Dirichlet boundary condition u(x, 0) = /( x). A solution to this 
problem is given by the convolution u(x, y) = (/ * Py d ^)(x) where Py d \x ) 
is the d-dimensional Poisson kernel 

P^ d \ x)= f e 27 rix< e- 2 ^\y d£. 

Jm. d 

Compute Py ^ (x) by using the subordination principle and the d-dimensional 
heat kernel. (See Exercise 7.) Show that 


p(d)( x ) = r (( rf + !)/ 2 ) V 

y W 7T (d+l)/2 (| x |2 +y 2)(d+l)/2- 


9. A spherical wave is a solution u(x, t) of the Cauchy problem for the wave 
equation in which as a function of x is radial. Prove that u is a spherical 
wave if and only if the initial data f,g G S are both radial. 

10. Let u(x, t ) be a solution of the wave equation, and let E(t) denote the energy 
of this wave 


m = 


lR d 


du , . 


d 

+E 

3 = 1 ' 


R d 


du , . 


dx. 


We have seen that E(t) is constant using Plancherel’s formula. Give an alternate 
proof of this fact by differentiating the integral with respect to t and showing 
that 

dE 

— = 0. 
dt 
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[Hint: Integrate by parts.] 

11. Show that the solution of the wave equation 

d 2 u d 2 u d 2 u d 2 u 
dt 2 dx\ + dx% dxl 

subject to u(x,0) = f(x) and ^f(x,0) = g{x), where f,g G <S(R. 3 ), is given by 

u(x, t ) = / ' [ [ tg{y ) + f(y) + Vf{y) ■ ( y - x)] dcr{y), 

1^ I JS( x,t) 

where S(x,t) denotes the sphere of center x and radius t, and |5(a:,t)| its area. 
This is an alternate expression for the solution of the wave equation given in 
Theorem 3.6. It is sometimes called Kirchhoff ’s formula. 

12 . Establish the identity (14) about the dual transform given in the text. In 
other words, prove that 

(16) [ I n{f){t,j)F(t,'y)da('y)dt= f f(x)Tl*{F)(x)dx 

J r Js 2 Jr 3 

where / e <S(R 3 ), F e S(R x S 2 ), and 

K(f)= [ f and TZ*(F)(x)= [ F(x ■ 7, 7) ^(7). 

Jvt n Js 2 

[Hint: Consider the integral 


/ / f{t’y + uie 2 +U 2 e 2 )F(t,'y)dtda('y), 


Integrating first in u gives the left-hand side of (16), while integrating in u and 
t and setting x = t-y + Uie 2 + u 2 e 2 gives the right-hand side.] 


13 . For each (t, 6) with t£l and |0| < 7 r, let L = L t £ denote the line in the 
(x, y)-plane given by 


xcosd + ysind = t. 

This is the line perpendicular to the direction (cos 9, sin 9) at “distance” t from 
the origin (we allow negative t ). For / £ (S(R 2 ) the X-ray transform or two- 
dimensional Radon transform of / is defined by 




f = 


fit cos 9 + u sin 6, t sin 9 — u cos 9) du. 
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Calculate the X-ray transform of the function f{x,y) = e -7r 0 2 +y 2 ). 

14. Let X be the X-ray transform. Show that if / 6 S and X(/) = 0, then 
/ = 0, by taking the Fourier transform in one variable. 

15. For F £ <S(R x S' 1 ), define the dual X-ray transform X*(F) by integrat- 
ing F over all lines that pass through the point (x, y ) (that is, those lines L t 
with x cos 6 + y sin 6 = t) : 

X*(F)(x,y) = / F(x cosd + y sin 9,9) d9. 


Check that in this case, if / £ S(R 2 ) and F £ S(R x S 1 ), then 


J J X(f)(t,0)F(t, 9) dtdO 


f(x,y)X*(F)(x,y)dxdy. 


7 Problems 

1. Let J n denote the n th order Bessel function, for n £ Z. Prove that 

(a) J n (p) is real for all real p. 

(b) J_„(p) = (-1)" J n (p). 

(c) 2 J' n (p) = J n _i(p) - J„+i(p). 

(d) (&) J„(p) = J n _r(p) + J n+1 (p). 

(e) (p- n J n (p)y = -p~ n J n+ i{p). 

(f) (p n J n (p)Y = p"J n _r(p). 

(g) J 7l (p) satishes the second order differential equation 

JY(p) + P~ 1j u(p) + C 1 - n 2 /p 2 )J n (p) = 0. 

(h) Show that 

m = 0 v ’ 

(i) Show that for all integers n and all real numbers a and b we have 

JniP 1 + &) = ^ ^ 
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2. Another formula for J„(p) that allows one to define Bessel functions for 
non-integral values of n, (n > —1/2) is 



(a) Check that the above formula agrees with the definition of J n (p ) for in- 
tegral n > 0. [Hint: Verify it for n = 0 and then check that both sides 
satisfy the recursion formula (e) in Problem 1.] 



(c) 


Prove that 



(d) Observe that the formulas we have proved in the text giving F 0 in terms 
of /o (when describing the Fourier transform of a radial function) take the 
form 



for d = 1,2, and 3, if one uses the formulas above with the understanding 
that J_i/ 2 (p) = lim,,^.]^ Jn(p)- It turns out that the relation between 
Fq and /o given by (17) is valid in all dimensions d. 

3. We observed that the solution u(x,t) of the Cauchy problem for the wave 
equation given by formula (3) depends only on the initial data on the base on 
the backward light cone. It is natural to ask if this property is shared by any 
solution of the wave equation. An affirmative answer would imply uniqueness of 
the solution. 

Let B(xo,ro) denote the closed ball in the hyperplane t = 0 centered at xo 
and of radius r 0 . The backward light cone with base B(xo,ro) is defined by 

£ B(x 0 ,r 0 ) = {fy, ()el' i xR:|i-io|<ro-i, 0 <t< r 0 }. 

Theorem Suppose that u(x, t) is a C 2 function on the closed upper half-plane 
{( x,t ) : x G R d ,t > 0} that solves the wave equation 



If u(x, 0) = Qjf(x,0) = 0 for all x G B(xo,ro), then u(x,t) = 0 for all (x,t) G 
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In words, if the initial data of the Cauchy problem for the wave equation 
vanishes on a ball B, then any solution u of the problem vanishes in the backward 
light cone with base B. The following steps outline a proof of the theorem. 

(a) Assume that u is real. For 0 < t < r 0 let B t (x o, r 0 ) = {a; : \x — rro | < r 0 — 
t}, and also define 


Vu(x,t) = ( Q __ ,..., 


du du du 
dxi ’ ’ dxd ’ dt 


Now consider the energy integral 


m = 2 


Vttr dx 


B t (x 0 ,r 0 ) 


du \ 2 ( du\ 2 

w) + Ebs: *■ 


JB t (x 0 ,r 0 ) \ / j =1 

Observe that E(t) > 0 and -E(O) = 0. Prove that 


*M-/ 

JB t (x o,r 0 ) 0t 0t , = i 


du d 2 u 1 f 

j=i dxjdxidt x 2 J dBt{xoiro) 


Vm | 2 da( 7 ). 


(b) Show that 


d 

dx-j 


du du 
dxj dt 


du d 2 u d 2 u du 
dx.j dxjdt dx 2 dt 


(c) Use the last identity, the divergence theorem, and the fact that u solves 
the wave equation to prove that 

, f du du If , 2 . . 

E (*) = / da (7) “ 2 / |Vw| da(7) ’ 

J dB t (x 0 ,r 0 ) - = 1 OX J 01 - J dB t (x 0 ,r 0 ) 

where Vj denotes the j th coordinate of the outward normal to B t (xo,r 0 ). 

(d) Use the Cauchy-Schwarz inequality to conclude that 


du du 1 . l2 


3=1 


and as a result, E'(t) < 0. Deduce from this that Eft) = 0 and u = 0. 
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4.* There exist formulas for the solution of the Cauchy problem for the wave 
equation 

( j- 7 / (}- 7 / d 2 V Bll 

-QP=Q X 2+-- + Q X 2 With «(*> °) = /(*) alld Qt ( X > °) = S(*) 

in x R in terms of spherical means which generalize the formula given in the 
text for 4 = 3. In fact, the solution for even dimensions is deduced from that for 
odd dimensions, so we discuss this case first. 

Suppose that d > 1 is odd and let h £ 5(R d ). The spherical mean of h on the 
ball centered at x of radius t is defined by 


M r h{ x) = Mh{x , r) = -P / /i(a; — ry) da (7), 

Ad Jcd-i 


s d ~ 

where denotes the area of the unit sphere S^ -1 in 
(a) Show that 

d- 1 


A x Mh(x, r ) = 




Mh(x, r), 


where A x denotes the Laplacian in the space variables x, and d r = d/dr. 

(b) Show that a twice differentiable function u(x, t) satisfies the wave equation 
if and only if 



d- 1 
r 


Mu(x , r, t ) 


d/Mu( x, r, t), 


where Mu(x,r,t) denote the spherical means of the function u{x,t). 

(c) If d = 2 k + 1, define Tip(r) = (r~ 1 d r ) k ~ 1 [r 2k ~ l ip{r)] 1 and let u = TMu. 
Then this function solves the one-dimensional wave equation for each fixed 
x: 


d/u(x, r, t) = d/u(x , r, t). 

One can then use d’Alembert’s formula to find the solution u(x, r, t) of 
this problem expressed in terms of the initial data. 

(d) Now show that 


u(x,t) = Mu(x, 0, t) = lim 

7 >0 


u(x, r, t) 
ar 


where a = 1 • 3 • • • (d — 2). 
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(e) Conclude that the solution of the Cauchy problem for the d-dimensional 
wave equation, when d > 1 is odd, is 

«(M) = 1 , 3 ,. 1 (rf _ 2) (t d ~ 2 M t f(x)) + 


(f 1 ^- 3 )/ 2 (i t d ~ 2 M t g(x ))] . 


5.* The method of descent can be used to prove that the solution of the Cauchy 
problem for the wave equation in the case when d is even is given by the formula 


u(x,t) = 


1 

1 • 3 • • • (d — 2) 






where M t denotes the modified spherical means defined by 


M t h(x) = 


f{x + ty) 


W 1 JB d \J\ - 


dy. 


6.* Given initial data / and g of the form 

f(x) = F(x ■ 7) and g(x) = G(x ■ 7), 
check that the plane wave given by 


u(x,t) = 


F(x ■ 7 + t) + F(x ■ 7 — t) 1 


f'X-'y+t 


+ 2 


G(s) ds 


x-'y—t 


is a solution of the Cauchy problem for the d-dimensional wave equation. 

In general, the solution is given as a superposition of plane waves. For the 
case d = 3, this can be expressed in terms of the Radon transform as follows. 
Let 


^(/)(£, 7) = - 


1 


87 r 2 V dt 


d 


— 7). 


Then u(x , t ) = 

7 : / ^(/)(z-7-b7)+77(/)(>-7 + b7) + 


s 2 


fX-'y+t 


^•(S , )( s ; 7) d S 


• x-'y—t 


d<r(7). 
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7. For every real number a > 0, define the operator (— A)“ by the formula 



whenever / 6 <S(R d ). 

(a) Check that (— A) a agrees with the usual definition of the a th power of 
—A (that is, a compositions of minus the Laplacian) when a is a positive 
integer. 

(b) Verily that (—A )“(/) is indefinitely differentiable. 

(c) Prove that if a is not an integer, then in general (—A )“(/) is not rapidly 
decreasing. 

(d) Let u(x, y) be the solution of the steady-state heat equation 



given by convolving / with the Poisson kernel (see Exercise 8). Check 
that 


(— A) 1/2 /(:r) = - \\m —(x,y), 
v^o ay 


and more generally that 



for any positive integer k. 


8.* The reconstruction formula for the Radon transform in R d is as follows: 
(a) When d = 2, 



where (—A) 1 ' 2 is defined in Problem 7. 

(b) If the Radon transform and its dual are defined by analogy to the cases 
d = 2 and d = 3, then for general d, 



2 


(— A ) ( d -i)/ 2 7 7*(77 (/)) = /. 
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Finite Fourier Analysis 


This past year has seen the birth, or rather the re- 
birth, of an exciting revolution in computing Fourier 
transforms. A class of algorithms known as the fast 
Fourier transform or FFT, is forcing a complete re- 
assessment of many computational paths, not only in 
frequency analysis, but in any fields where problems 
can be reduced to Fourier transforms and/or convolu- 
tions... 

C. Bingham and J. W. Tukey, 1966 


In the previous chapters we studied the Fourier series of functions on 
the circle and the Fourier transform of functions defined on the Euclidean 
space R d . The goal here is to introduce another version of Fourier analy- 
sis, now for functions defined on finite sets, and more precisely, on finite 
abelian groups. This theory is particularly elegant and simple since infi- 
nite sums and integrals are replaced by finite sums, and thus questions 
of convergence disappear. 

In turning our attention to finite Fourier analysis, we begin with the 
simplest example, Z(IV), where the underlying space is the (multiplica- 
tive) group of IV th roots of unity on the circle. This group can also be 
realized in additive form, as Z/IVZ, the equivalence classes of integers 
modulo N. The group Z (N) arises as the natural approximation to the 
circle (as N tends to infinity) since in the first picture the points of Z (IV) 
correspond to N points on the circle which are uniformly distributed. For 
this reason, in practical applications, the group Z(iV) becomes a natural 
candidate for the storage of information of a function on the circle, and 
for the resulting numerical computations involving Fourier series. The 
situation is particularly nice when N is large and of the form N — 2 n . 
The computations of the Fourier coefficients now lead to the “fast Fourier 
transform,” which exploits the fact that an induction in n requires only 
about log N steps to go from N = 1 to N — 2 n . This yields a substantial 
saving in time in practical applications. 

In the second part of the chapter we undertake the more general the- 
ory of Fourier analysis on finite abelian groups. Here the fundamental 
example is the multiplicative group Z *(q). The Fourier inversion formula 
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for Z*(g) will be seen to be a key step in the proof of Dirichlet’s theorem 
on primes in arithmetic progression, which we will take up in the next 
chapter. 

1 Fourier analysis on Z (TV) 

We turn to the group of N th roots of unity. This group arises naturally as 
the simplest finite abelian group. It also gives a uniform partition of the 
circle, and is therefore a good choice if one wishes to sample appropriate 
functions on the circle. Moreover, this partition gets finer as N tends to 
infinity, and one might expect that the discrete Fourier theory that we 
discuss here tends to the continuous theory of Fourier series on the circle. 
In a broad sense, this is the case, although this aspect of the problem is 
not one that we develop. 


1.1 The group Z(IV) 

Let N be a positive integer. A complex number z is an N th root of 
unity if z N — 1. The set of IV th roots of unity is precisely 



Indeed, suppose that z N = 1 with z — re ld . Then we must have r N e lNd = 
1, and taking absolute values yields r — 1 . Therefore e lNe = 1, and this 
means that NO = 2nk where k G Z. So if £ = e 2m / N we find that 
exhausts all the N th roots of unity. However, notice that ( N — 1 so if 
n and m differ by an integer multiple of N, then j n fact, it is 

clear that 


C, n = if and only if n — m is divisible by N. 


We denote the set of all N th roots of unity by Z(N). The fact that 
this set gives a uniform partition of the circle is clear from its definition. 
Note that the set Z (N) satisfies the following properties: 

(i) If z, w G Z(IV), then zw € Z (N) and zw = wz. 

(ii) 1 e Z(iV). 

(iii) If z € Z (IV), then z _1 = l/z£ Z(AT) and of course zz -1 = 1. 

As a result we can conclude that Z (N) is an abelian group under complex 
multiplication. The appropriate definitions are set out in detail later in 
Section 2.1. 
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Z{ 9), C = e 2ni/9 Z(N), N = 2 6 

Figure 1. The group of ./V th roots of unity when N — 9 and N — 2 6 — 
64 


There is another way to visualize the group Z(iV). This consists of 
choosing the integer power of £ that determines each root of unity. We 
observed above that this integer is not unique since whenever n 

and m differ by an integer multiple of N. Naturally, we might select the 
integer which satisfies 0 < n < N — 1. Although this choice is perfectly 
reasonable in terms of “sets,” we ask what happens when we multiply 
roots of unity. Clearly, we must add the corresponding integers since 
C n C m _ ^ n+m no thing guarantees that 0<n + m<N — 1. In fact, 
if C i n C > rn — with 0 < k < N — 1, then n + m and k differ by an integer 
multiple of N. So, to find the integer in [0, N — 1] corresponding to the 
root of unity ( ' n ( m , we see that after adding the integers n and m we 
must reduce modulo N, that is, find the unique integer 0 < k < N — 1 
so that (n + m) — k is an integer multiple of N . 

An equivalent approach is to associate to each root of unity oj the 
class of integers n so that ( n = u>. Doing so for each root of unity we 
obtain a partition of the integers in N disjoint infinite classes. To add 
two of these classes, choose any integer in each one of them, say n and 
m, respectively, and define the sum of the classes to be the class which 
contains the integer n + m. 

We formalize the above notions. Two integers x and y are congru- 
ent modulo N if the difference x — y is divisible by N, and we write 
x = y mod N . In other words, this means that x and y differ by an 
integer multiple of N. It is an easy exercise to check the following three 
properties: 
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• x = x mod N for all integers x. 

• If x = y mod N, then y = x mod N. 

• If x = y mod N and y = z mod N, then x = z mod N. 

The above defines an equivalence relation on Z. Let R(x) denote the 
equivalence class, or residue class, of the integer x. Any integer of the 
form x + kN with k E Z is an element (or “representative”) of R(x). 
In fact, there are precisely N equivalence classes, and each class has a 
unique representative between 0 and N — 1. We may now add equiva- 
lence classes by defining 

R(x) + R(y) = R(x + y ). 

This definition is of course independent of the representatives x and y 
because if x' € R(x) and y' E R(y), then one checks easily that x' + y' € 
R{x + y). This turns the set of equivalence classes into an abelian group 
called the group of integers modulo N, which is sometimes denoted 
by Z/iVZ. The association 

R(k) < — * e ^/N 

gives a correspondence between the two abelian groups, Z/iVZ and Z (N). 
Since the operations are respected, in the sense that addition of inte- 
gers modulo N becomes multiplication of complex numbers, we shall 
also denote the group of integers modulo N by Z (N). Observe that 
0 € Z/IVZ corresponds to 1 on the unit circle. 

Let V and W denote the vector spaces of complex- valued functions on 
the group of integers modulo N and the IV th roots of unity, respectively. 
Then, the identification given above carries over to V and W as follows: 

F(k) < — > f(e 2nik/N ), 

where F is a function on the integers modulo N and / is a function on 
the IV th roots of unity. 

From now on, we write Z (N) but think of either the group of integers 
modulo N or the group of IV th roots of unity. 

1.2 Fourier inversion theorem and Plancherel identity on Z (IV) 

The first and most crucial step in developing Fourier analysis on Z (IV) is 
to find the functions which correspond to the exponentials e n (x) = e 2n ' inx 
in the case of the circle. Some important properties of these exponentials 


are: 
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(i) {e n } n e z is an orthonormal set for the inner product (1) (in Chap- 
ter 3) on the space of Riemann integrable functions on the circle. 

(ii) Finite linear combinations of the e n ’s (the trigonometric polyno- 
mials) are dense in the space of continuous functions on the circle. 

(hi) e n (x + y) = e n (x)e n (y). 

On Z(7V), the appropriate analogues are the N functions eo, . . . , ejv-i 
defined by 

e t [k) = C ek = e 2nUk/N for t = 0, . . . , N - 1 and k = 0, . . . , N - 1, 

where ( — e 27rl / N . To understand the parallel with (i) and (ii), we can 
think of the complex- valued functions on Z (AT) as a vector space V, 
endowed with the Hermitian inner product 


TV- 1 

(F,G) = Y,mW) 


k = 0 


and associated norm 

TV-1 

M 2 = E l F ( fc )l 2 - 

fc =0 

Lemma 1.1 The family {eo, . . . , e^-i} is orthogonal. In fact, 


(^m? &t) 


N if m = l, 
0 if m l. 


Proof. We have 


TV-1 TV-1 

^) = E c mfc c' fe = E c ( 

k = 0 k = 0 


m—t)k 


If m = i, each term in the sum is equal to 1, and the sum equals N . If 
m ^ I , then q = is not equal to 1 , and the usual formula 

1 - n N 

1 +q + q 2 + --- + q N ~' = 7 

1 - q 

shows that (e m ,e^) = 0, because q N = 1. 

Since the N functions eo, ■ ■ . , ejy-i are orthogonal, they must be lin- 
early independent, and since the vector space V is TV-dimensional, we 
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conclude that {eo, . . . , ejv-i} is an orthogonal basis for V. Clearly, prop- 
erty (iii) also holds, that is, eg(k + m) = ee(k)ee(m) for all £, and all 
k, m £ Z(iV). 

By the lemma each vector ei has norm VN, so if we define 

e ‘ = VN e " 

then {eg, . . . , e* N _ l } is an orthonormal basis for V. Hence for any F £ V 
we have 

N - 1 JV-1 

(1) F — E ( F , e n) e n as well as ||F|| 2 = E \(F, e ;)| 2 . 

71 = 0 71 = 0 

If we define the n th Fourier coefficient of F by 

N - 1 

^ E me~ 2nikn/N , 

k = 0 

the above observations give the following fundamental theorem which is 
the Z (TV) version of the Fourier inversion and the Parseval-Plancherel 
formulas. 

Theorem 1.2 If F is a function onZ(N), then 


N - 1 

m = E o n e ^ 


n = 0 


Moreover, 


N - 1 . N - 1 

E M 2 = jf E \ F ( k )\ 2 - 

n = 0 k = 0 

The proof follows directly from (1) once we observe that 
~ TV ^ = 71 ^ e ^' 

Remark. It is possible to recover the Fourier inversion on the circle 
for sufficiently smooth functions (say C 2 ) by letting N — > oo in the finite 
model Z (AT) (see Exercise 3). 
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1.3 The fast Fourier transform 

The fast Fourier transform is a method that was developed as a means 
of calculating efficiently the Fourier coefficients of a function F on Z (IV). 

The problem, which arises naturally in numerical analysis, is to deter- 
mine an algorithm that minimizes the amount of time it takes a computer 
to calculate the Fourier coefficients of a given function on Z(IV). Since 
this amount of time is roughly proportional to the number of operations 
the computer must perform, our problem becomes that of minimizing 
the number of operations necessary to obtain all the Fourier coefficients 
{a n } given the values of F on Z (IV). By operations we mean either an 
addition or a multiplication of complex numbers. 

We begin with a naive approach to the problem. Fix IV, and suppose 
that we are given F(0), . . . , F(N — 1) and — e ~ 2m / N . If we denote 
by (F) the k th Fourier coefficient of F on Z(iV), then by definition 

N - 1 

a^(F) = ^ E F ^n, 

r = 0 

and crude estimates show that the number of operations needed to cal- 
culate all Fourier coefficients is < 2 N 2 + N. Indeed, it takes at most 
N — 2 multiplications to determine . . . , and each coefficient 

a requires N + 1 multiplications and N — 1 additions. 

We now present the fast Fourier transform, an algorithm that im- 
proves the bound 0(N 2 ) obtained above. Such an improvement is possi- 
ble if, for example, we restrict ourselves to the case where the partition 
of the circle is dyadic, that is, N — 2 n . (See also Exercise 9.) 

Theorem 1.3 Given u>n = e~ 2nl / N with N = 2 n , it is possible to calcu- 
late the Fourier coefficients of a function on Z(IV) with at most 

4 ■ 2 n n = 4IVlog 2 (IV) = Q(N log IV) 


operations. 

The proof of the theorem consists of using the calculations for M 
division points, to obtain the Fourier coefficients for 2 M division points. 
Since we choose N = 2 n , we obtain the desired formula as a consequence 
of a recurrence which involves n = OifogN) steps. 

Let #(M) denote the minimum number of operations needed to cal- 
culate all the Fourier coefficients of any function on Z(M). The key to 
the proof of the theorem is contained in the following recursion step. 
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Lemma 1.4 If we are given u> 2 M = e 2 W( 2M f then 

#(2 M) < 2 #(M) + 8M. 

Proof. The calculation of 0 J 2 m-, ■ ■ ■ > w 2 M requires no more than 2M 
operations. Note that in particular we get ujm — e^ 2npM — w\ M . The 
main idea is that for any given function F on Z(2 M), we consider two 
functions F 0 and F- { on Z(M) defined by 

F 0 (r) = F(2r) and Ffr) = F(2r +1). 

We assume that it is possible to calculate the Fourier coefficients of F 0 
and F\ in no more than ff(M) operations each. If we denote the Fourier 
coefficients corresponding to the groups Z(2 M) and Z(M) by a\ M and 
ajf , respectively, then we have 

al M (F) = ^ (af (F 0 ) + af (F\)u; 2 fe M ) . 

To prove this, we sum over odd and even integers in the definition of the 
Fourier coefficient a\ M (F), and find 

.. 2M-1 

al M ( p) = Hr)*™ 

r = 0 

1 M — l 1 M—l \ 

= sE nwJaP 4e n 2 m+ iH‘r +1) 

V 1=0 m = 0 / 

. / -. M—l . M—l \ 

= 2 (t7 E + T7 E , 

V £=0 m = 0 / 

which establishes our assertion. 

As a result, knowing af (F 0 ), af (Fi), and we see that each 

af, M (F) can be computed using no more than three operations (one ad- 
dition and two multiplications). So 

#(2 M) < 2 M + 2 #(M) + 3x2 M = 2 #(M) + 8 Af, 

and the proof of the lemma is complete. 

An induction on n, where N = 2 n , will conclude the proof of the the- 
orem. The initial step n — 1 is easy, since N — 2 and the two Fourier 
coefficients are 

<${F) = \ (F( 1) + F(- 1)) and af (F) = i (F(l) + (-l)F(-l)) . 
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Calculating these Fourier coefficients requires no more than five opera- 
tions, which is less than 4x2 = 8. Suppose the theorem is true up to 
N = 2 n ~ 1 so that #(-/V) < 4 ■ 2 n-1 (n — 1). By the lemma we must have 

#(2 N) < 2 • 4 • 2 n ~ l (n - 1) + 8 ■ 2 n_1 = 4 ■ 2 n n, 

which concludes the inductive step and the proof of the theorem. 

2 Fourier analysis on finite abelian groups 

The main goal in the rest of this chapter is to generalize the results about 
Fourier series expansions obtained in the special case of Z (N). 

After a brief introduction to some notions related to finite abelian 
groups, we turn to the important concept of a character. In our set- 
ting, we find that characters play the same role as the exponentials 
eo,...,ejv_i on the group Z (IV), and thus provide the key ingredient 
in the development of the theory on arbitrary finite abelian groups. In 
fact, it suffices to prove that a finite abelian group has “enough” charac- 
ters, and this leads automatically to the desired Fourier theory. 

2.1 Abelian groups 

An abelian group (or commutative group) is a set G together with a 
binary operation on pairs of elements of G , (a, b) i— > a ■ b, that satisfies 
the following conditions: 

(i) Associativity: a ■ (6 ■ c) = (a ■ b) ■ c for all a,b,c £ G. 

(ii) Identity : There exists an element u £ G (often written as either 1 
or 0) such that a ■ u — u ■ a — a for all a £ G. 

(iii) Inverses: For every a £ G, there exists an element a^ 1 £ G such 
that a ■ a = a -1 ■ a = u. 

(iv) Commutativity: For a,b £ G, we have a ■ b = b ■ a. 

We leave as simple verifications the facts that the identity element and 
inverses are unique. 

Warning. In the definition of an abelian group, we used the “multi- 
plicative” notation for the operation in G. Sometimes, one uses the “ad- 
ditive” notation a + b and —a, instead of a ■ b and a -1 . There are times 
when one notation may be more appropriate than the other, and the 
examples below illustrate this point. The same group may have different 
interpretations, one where the multiplicative notation is more suggestive, 
and another where it is natural to view the group with addition, as the 
operation. 
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Examples of abelian groups 

• The set of real numbers M with the usual addition. The identity is 
0 and the inverse of x is — x. 

Also, M — {0} and M + = {x € IR. : x > 0} equipped, with the stan- 
dard multiplication, are abelian groups. In both cases the unit is 1 
and the inverse of x is 1/x. 

• With the usual addition, the set of integers Z is an abelian group. 
However, Z — {0} is not an abelian group with the standard mul- 
tiplication, since, for example, 2 does not have a multiplicative 
inverse in Z. In contrast, Q — {0} is an abelian group with the 
standard multiplication. 

• The unit circle S 1 in the complex plane. If we view the circle as 
the set of points {e ld : 9 € IR}, the group operation is the standard 
multiplication of complex numbers. However, if we identify points 
on S 1 with their angle 9, then S 1 becomes M modulo 2-7T, where the 
operation is addition modulo 27r. 

• Z (TV) is an abelian group. Viewed as the V th roots of unity on the 
circle, Z (N) is a group under multiplication of complex numbers. 
However, if Z(iV) is interpreted as Z/IVZ, the integers modulo N, 
then it is an abelian group where the operation is addition modulo 
N. 

• The last example consists of Z *(q). This group is defined as the set 
of all integers modulo q that have multiplicative inverses, with the 
group operation being multiplication modulo q. This important 
example is discussed in more detail below. 

A homomorphism between two abelian groups G and H is a map 
/ : G — > H which satisfies the property 

/(a ■ b ) = /(a) ■ f(b), 

where the dot on the left-hand side is the operation in G , and the dot 
on the right-hand side the operation in H. 

We say that two groups G and H are isomorphic, and write G ~ //. 
if there is a bijective homomorphism from G to H. Equivalently, G and 
H are isomorphic if there exists another homomorphism / : H — > G. so 
that for all a £ G and b G H 


(/ ° /)( a ) — a and (/o/)(6) = 6. 
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Roughly speaking, isomorphic groups describe the “same” object because 
they have the same underlying group structure (which is really all that 
matters); however, their particular notational representations might be 
different. 

Example 1. A pair of isomorphic abelian groups arose already when 
we considered the group Z(iV). In one representation it was given as 
the multiplicative group of N th roots of unity in C. In a second repre- 
sentation it was the additive group Z/AZ of residue classes of integers 
modulo N. The mapping n i— > R(n), which associates to a root of unity 
2 = e 2 mn / N — Qn j-egidue class in Z/AIZ determined by n, provides 
an isomorphism between the two different representations. 

Example 2. In parallel with the previous example, we see that the circle 
(with multiplication) is isomorphic to the real numbers modulo 27r (with 
addition) . 

Example 3. The properties of the exponential and logarithm guarantee 
that 

exp : R — > R + and log : R + — > R 

are two homomorphisms that are inverses of each other. Thus R (with 
addition) and R + (with multiplication) are isomorphic. 

In what follows, we are primarily interested in abelian groups that are 
finite. In this case, we denote by |G| the number of elements in G, and 
call |G| the order of the group. For example, the order of Z (IV) is N. 

A few additional remarks are in order: 

• If G i and G '2 are two finite abelian groups, their direct product 
G\ x G 2 is the group whose elements are pairs (< 7 i, 92 ) with g\ € G\ 
and 52 € G 2 . The operation in G\ x G 2 is then defined by 

(fibi 92 ) ' ( 9i > 92 ) = ( 9i ' 9it 92 ' 92 )- 

Clearly, if G 1 and G 2 are finite abelian groups, then so is Gi x G 2 . 
The definition of direct product generalizes immediately to the case 
of finitely many factors G\ x Gj x ■ ■ • x G n - 

• The structure theorem for finite abelian groups states that such a 
group is isomorphic to a direct product of groups of the type Z {N)\ 
see Problem 2. This is a nice result which gives us an overview of 
the class of all finite abelian groups. However, since we shall not 
use this theorem below, we omit its proof. 
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We now discuss briefly the examples of abelian groups that play a 
central role in the proof of Dirichlet’s theorem in the next chapter. 

The group Z*(g) 

Let q be a positive integer. We see that multiplication in Z (q) can be 
unambiguously defined, because if n is congruent to n! and m is congruent 
to m! (both modulo q ), then nm is congruent to n'm i modulo q. An 
integer n € Z (q) is a unit if there exists an integer m 6 Z (q) so that 

nm = 1 mod q. 

The set of all units in Z (q) is denoted by Z *(g), and it is clear from our 
definition that Z *(q) is an abelian group under multiplication modulo q. 
Thus within the additive group Z (q) lies a set Z *(q) that is a group under 
multiplication. An alternative characterization of Z *(q) will be given in 
the next chapter, as those elements in Z (q) that are relatively prime to q. 

Example 4. The group of units in Z(4) = {0, 1, 2, 3} is 

Z*(4) = {1,3}. 

This reflects the fact that odd integers are divided into two classes de- 
pending on whether they are of the form 4fc + 1 or 4fc + 3. In fact, Z*(4) 
is isomorphic to Z(2). Indeed, we can make the following association: 

Z*(4) Z(2) 

1 4 — > 0 

3 ^ — > 1 

and then notice that multiplication in Z*(4) corresponds to addition in 
Z(2). 

Example 5. The units in Z(5) are 

Z*(5) = {1,2,3,4}. 

Moreover, Z*(5) is isomorphic to Z(4) with the following identification: 
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Example 6. The units in Z(8) = {0, 1,2,3, 4, 5, 6, 7} are given by 

Z*(8) = {1,3,5,7}. 


In fact, Z*(8) is isomorphic to the direct product Z(2) x Z(2). In this 
case, an isomorphism between the groups is given by the identification 


Z*(8) 

1 

3 

5 

7 


Z(2) x Z(2) 
( 0 , 0 ) 
( 1 . 0 ) 
( 0 , 1 ) 
( 1 , 1 ) 


2.2 Characters 

Let G be a finite abelian group (with the multiplicative notation) and 
S 1 the unit circle in the complex plane. A character on G is a complex- 
valued function e : G — > S 1 which satisfies the following condition: 

(2) e(a ■ b) = e(a)e(b) for all a,b E G. 

In other words, a character is a homomorphism from G to the circle 
group. The trivial or unit character is defined by e(a) = 1 for all 
a <E G. 

Characters play an important role in the context of finite Fourier se- 
ries, primarily because the multiplicative property (2) generalizes the 
analogous identity for the exponential functions on the circle and the 
law 


ee(k + m) — e £ (k)e e (m), 

which held for the exponentials eo, ■ . . , ejv-i used in the Fourier theory 
on Z (N). There we had eg(k) = C ik — e 2mek / N , with 0 < t < N — 1 and 
k € Z (IV), and in fact, the functions eo, . .. , ejv-i are precisely all the 
characters of the group Z (N). 

If G is a finite abelian group, we denote by G the set of all characters 
of G, and observe next that this set inherits the structure of an abelian 
group. 
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Lemma 2.1 The set G is an abelian group under multiplication defined 
by 

(ei ■ e 2 )(a) = ei(a)e 2 (a) for all a £ G. 

The proof of this assertion is straightforward if one observes that the 
trivial character plays the role of the unit. We call G the dual group 
of G. 

In light of the above analogy between characters for a general abelian 
group and the exponentials on Z(IV), we gather several more examples 
of groups and their duals. This provides further evidence of the central 
role played by characters. (See Exercises 4, 5, and 6.) 

Example 1. If G — Z(iV), all characters of G take the form ee(k) — f (k — 
e 2 mik/N f or some 0 < i < N — 1 , and it is easy to check that ei^> l gives 
an isomorphism from Z (IV) to Z (N). 

Example 2. The dual group of the circle 1 is precisely {e n } n€ z (where 
e n (x) — e 2mnx ). Moreover, e n h- > n gives an isomorphism between S 1 
and the integers Z. 

Example 3. Characters on R are described by 

e c (x) - e 2 * ^ x where (el. 

Thus i — t f is an isomorphism from R to R. 

Example 4. Since exp : R — > M + is an isomorphism, we deduce from the 
previous example that the characters on R + are given by 

eg(x) = x 2ni ^ — e 2ni £ lo g x where (el, 

and R+ is isomorphic to R (or R + ). 

The following lemma says that a nowhere vanishing multiplicative 
function is a character, a result that will be useful later. 

Lemma 2.2 Let G be a finite abelian group, and e : G — > C — {0} a mul- 
tiplicative function, namely e(a ■ b) — e(a)e(b) for all a,b £ G. Then e is 
a character. 


1 In addition to (2), the definition of a character on an infinite abelian group requires 

continuity. When G is the circle, R, or R + , the meaning of “continuous” refers to the 

standard notion of limit. 
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Proof. The group G being finite, the absolute value of e(a) is bounded 
above and below as a ranges over G. Since \e{b n )\ = \e(b)\ n , we conclude 
that |e(6)| — 1 for all b e G. 

The next step is to verify that the characters form an orthonormal 
basis of the vector space V of functions over the group G. This fact 
was obtained directly in the special case G = Z (TV) from the explicit 
description of the characters eo, • . . , ejv-i- 

In the general case, we begin with the orthogonality relations; then we 
prove that there are “enough” characters by showing that there are as 
many as the order of the group. 

2.3 The orthogonality relations 

Let V denote the vector space of complex- valued functions defined on the 
finite abelian group G. Note that the dimension of V is |G|, the order of 
G. We define a Hermitian inner product on V by 

(3) (f,g) = tL f( a ) 9 (a), whenever f,g G V. 

1^1 aeG 

Here the sum is taken over the group and is therefore finite. 

Theorem 2.3 The characters of G form an orthonormal family with 
respect to the inner product defined above. 

Since |e(a)| = 1 for any character, we find that 

( e > e ) = jir 5Z e («) e ( a ) i e (°oi 2 = L 

aeG 1^1 aeG 

If e^e' and both are characters, we must prove that (e, e') — 0; we 
isolate the key step in a lemma. 

Lemma 2.4 If e is a non-trivial character of the group G, then 
E aeG e(a) - 0. 

Proof. Choose b € G such that e(b) ^ 1. Then we have 

e{b) e («) = H e(b)e{a) = ^ e(ab) = ^ e(a). 

aEG clEG clEG clEG 

The last equality follows because as a ranges over the group, ab ranges 
over G as well. Therefore JfaeG e (°0 = 0- 
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We can now conclude the proof of the theorem. Suppose e! is a char- 
acter distinct from e. Because e(e')~ 1 is non- trivial, the lemma implies 
that 

e ( a )( e '( a )) _1 = °- 

Since (e'(a))^ 1 = e'(a), the theorem is proved. 

As a consequence of the theorem, we see that distinct characters are 
linearly independent. Since the dimension of V over C is |G|, we conclude 
that the order of G is finite and < G | . The main result to which we now 
turn is that, in fact, |G| = |G|. 

2.4 Characters as a total family 

The following completes the analogy between characters and the complex 
exponentials. 

Theorem 2.5 The characters of a finite abelian group G form a basis 
for the vector space of functions on G. 

There are several proofs of this theorem. One consists of using the 
structure theorem for finite abelian groups we have mentioned earlier, 
which states that any such group is the direct product of cyclic groups, 
that is, groups of the type Z (N). Since cyclic groups are self-dual, using 
this fact we would conclude that |G| = |G|, and therefore the characters 
form a basis for G. (See Problem 3.) 

Here we shall prove the theorem directly without these considerations. 

Suppose V is a vector space of dimension d with inner product (■,•). 

A linear transformation T : V — » V is unitary if it preserves the inner 
product, ( Tv,Tw ) — (v,w) for all v,w £ V. The spectral theorem from 
linear algebra asserts that any unitary transformation is diagonalizable. 
In other words, there exists a basis {iq, . . . ,Vd} (eigenvectors) of V such 
that T(vi) — A iVi, where Xi G C is the eigenvalue attached to Vi. 

The proof of Theorem 2.5 is based on the following extension of the 
spectral theorem. 

Lemma 2.6 Suppose {Ti , . . . , Tjf\ is a commuting family of unitary trans- 
formations on the finite- dimensional inner product space V; that is, 

TiTj = TjTi for all i,j. 

Then Ti, . . . , T^ are simultaneously diagonalizable. In other words, there 
exists a basis for V which consists of eigenvectors for every 
Ti, i — 1, . . . ,k. 
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Proof. We use induction on k. The case k = 1 is simply the spec- 
tral theorem. Suppose that the lemma is true for any family of k — 1 
commuting unitary transformations. The spectral theorem applied to Tk 
says that V is the direct sum of its eigenspaces 

V = V Xl ®---®V Xg , 

where V Xi denotes the subspace of all eigenvectors with eigenvalue A 
We claim that each one of the Ti, . . . , Tk - 1 maps each eigenspace V Xi to 
itself. Indeed, if v E V Xi and 1 < j < k — 1, then 


T k Tj(v) = TjT k {y) = Tj(Xiv) = XfTj^v) 

so Tj(y) E V x . , and the claim is proved. 

Since the restrictions to V Xi of Tf, . . . , T k ~ i form a family of commut- 
ing unitary linear transformations, the induction hypothesis guarantees 
that these are simultaneously diagonalizable on each subspace V Xi . This 
diagonalization provides us with the desired basis for each V x . , and thus 
for V. 

We can now prove Theorem 2.5. Recall that the vector space V of 
complex- valued functions defined on G has dimension \G\ . For each 
a E G we define a linear transformation T a : V — » V by 

(■ T a f){x ) = f(a ■ x ) for x E G. 

Since G is abelian it is clear that T a Tb = T&T Q for all a,b E G, and one 
checks easily that T a is unitary for the Hermitian inner product (3) de- 
fined on V . By Lemma 2.6 the family {T a } a ^G is simultaneously di- 
agonals able. This means there is a basis {uh(a:)}6gG for V such that 
each Vb(x) is an eigenfunction for T a , for every a. Let v be one of these 
basis elements and 1 the unit element in G. We must have u(l) ^ 0 for 
otherwise 


v(a) = v(a ■ 1) = (T a v)( 1) = A a u(l) = 0, 


where A a is the eigenvalue of v for T a . Hence v = 0, and this is a contra- 
diction. We claim that the function defined by w(x) = X x = v(x)/v(l) 
is a character of G. Arguing as above we find that w(x) ^ 0 for every x, 
and 


w(a ■ b) 


v(a ■ b) 

«( 1 ) 


XgVjb) 

«( 1 ) 


Aa A b 


u(l) 

u(l) 


A a A b = w(a)w(b). 


We now invoke Lemma 2.2 to conclude the proof. 
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2.5 Fourier inversion and Plancherel formula 

We now put together the results obtained in the previous sections to 
discuss the Fourier expansion of a function on a finite abelian group G. 
Given a function f on G and character e of G, we define the Fourier 
coefficient of / with respect to e, by 

/(e) = (/,e) = /(«) e («)> 

aeG 


and the Fourier series of / as 

e££r 

Since the characters form a basis, we know that 

f =Y, c * e 

b£G 

for some set of constants c e . By the orthogonality relations satisfied by 
the characters, we find that 


(/, e) = c e , 

so / is indeed equal to its Fourier series, namely, 

/ = XJ /( e ) e - 

e£(jr 


We summarize our results. 

Theorem 2.7 Let G be a finite abelian group. The characters of G form 
an orthonormal basis for the vector space V of functions on G equipped 
with the inner product 


( f,9 ) = t 4 t f( a )9{a)- 

aeG 

In particular, any function f on G is equal to its Fourier series 

f = Yl 

e££r 

Finally, we have the Parseval-Plancherel formula for finite abelian 
groups. 
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Theorem 2.8 If f is a function on G, then ||/|| 2 3 = ^ |/(e)| 2 . 

e££r 

Proof Since the characters of G form an orthonormal basis for the 
vector space V, and (/, e) = /(e), we have that 

ll/ll 2 = (/,/) - ^(/,e)7(e) = ^ |/(e)| 2 . 

e£G e£Gr 


The apparent difference of this statement with that of Theorem 1.2 
is due to the different normalizations of the Fourier coefficients that are 
used. 

3 Exercises 

1. Let / be a function on the circle. For each N > 1 the discrete Fourier 
coefficients of / are defined by 

1 N 

a N (n) = — f(e 2 ™ k / N )e- 2nlkn/N , for n£ Z. 

k = 1 


We also let 

a(n) = [ f{e 2 ™) e - 2 ™™ dx 
Jo 

denote the ordinary Fourier coefficients of /. 

(a) Show that ajv(n) = aj^(n + N). 

(b) Prove that if / is continuous, then ajv(n) —■ > a(n) as N — » oo. 


2. If / is a C 1 function on the circle, prove that |ajv(n)| < c/|n| whenever 
0 < |n| < N/2. 

[Hint: Write 


ajv(n)[l - e 2%Un/N ] = — ^[/(e 27rlfc/JV ) - f( e 2 ™(k+e)/N^ e - 2 nikn/N, 


k=l 


and choose I so that In/N is nearly 1/2.] 


3. By a similar method, show that if / is a C 2 function on the circle, then 
|flvr(n)| < c/|n| 2 , whenever 0 < |n| < N/2. 
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As a result, prove the inversion formula for / G C 2 , 

oo 

f(e 2nix ) = Y a{n)e 2ninx 

n =— oo 


from its finite version. 

[Hint: For the first part, use the second symmetric difference 

/( 

For the second part, if N is odd (say), write the inversion formula as 
/(e 2 ™ k /N)= Y a N (n)e 2nikn/N .] 

\n\<N/2 


e 2iri(k+i)/N^ _j_ j (X2iri(k-l) /N j _ 2 / ( e 2nik / N ) 
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4. Let e be a character on G = Z(iV), the additive group of integers modulo N. 
Show that there exists a unique 0 < £ < N — Iso that 

e(k) = e e {k) = e 2Mk/N for all k 6 Z (N). 

Conversely, every function of this type is a character on Z (N). Deduce that 
ee i— > £ defines an isomorphism from G to G. 

[Hint: Show that e(l) is an N th root of unity.] 

5. Show that all characters on S' 1 are given by 

e n (x) = e 27rinx with n £ Z, 

and check that e n h- > n defines an isomorphism from S’ 1 to Z. 

[Hint: If F is continuous and F(x + y) = F(x)F(y), then F is differentiable. To 
see this, note that if F(0) ^ 0, then for appropriate 5, c = f 0 F(y) dy ^ 0, and 
cF(x) = fj +X F{y)dy. Differentiate to conclude that F( x) = e Ax for some A.} 

6 . Prove that all characters on R take the form 

e^x) = e 271 ^ with (6l, 

and that hh > ^ dehnes an isomorphism from R to R. The argument in Exercise 5 

applies here as well. 

7. Let C = e 2?rl / JV . Dehne the N x N matrix M = (. a,jk)i<j,k<N by a,jk = 


(a) Show that M is unitary. 
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(b) Interpret the identity ( Mu,Mv ) = (u,v) and the fact that M* = Af _1 in 
terms of Fourier series on Z(iV). 


N 

8. Suppose that P(x) = a n e 2lrmx . 

n= 1 

(a) Show by using the Parseval identities for the circle and Z(AT), that 

r 1 i N 

/ \P{x)\ 2 dx = — ^2\P{j/N)\ 2 . 

J o J= i 

(b) Prove the reconstruction formula 

N 

P(x) = Y,PU/N)I<(x-(j/N)) 

J=1 


where 


ix i p2niNx i 

*(*) = -JT 1 - = N^ X + e27Tl2X + ' ' ' + C ^ NX) - 

Observe that P is completely determined by the values P(j/N) for 1 < j < N. 
Note also that A'(0) = 1, and K(j/N ) = 0 whenever j is not congruent to 0 
modulo N. 

9. To prove the following assertions, modify the argument given in the text. 

(a) Show that one can compute the Fourier coefficients of a function on Z(iV) 
when N = 3 n with at most 6 N log 3 N operations. 

(b) Generalize this to N = a n where a is an integer > 1. 

10. A group G is cyclic if there exists g £ G that generates all of G, that is, 
if any element in G can be written as g n for some n e Z. Prove that a finite 
abelian group is cyclic if and only if it is isomorphic to Z(7V) for some N . 

11 . Write down the multiplicative tables for the groups Z*(3), Z*(4), Z*(5), 
Z*(6), Z*(8), and Z*(9). Which of these groups are cyclic? 

12. Suppose that G is a finite abelian group and e : G — ► C is a function that 
satisfies e(x ■ y) = e(x)e(y) for all x,y e G. Prove that either e is identically 0, 
or e never vanishes. In the second case, show that for each x. e(x) = e 2nlr for 
some r £ Q of the form r = p/q, where q = |G|. 
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13. In analogy with ordinary Fourier series, one may interpret finite Fourier 
expansions using convolutions as follows. Suppose G is a finite abelian group, 
1 g its unit, and V the vector space of complex- valued functions on G. 

(a) The convolution of two functions / and g in V is defined for each a £ G 
by 


{f*g)(a)=rLj2f{b)g{a-b x ). 

' ' beG 


Show that for all e € G one has (/ * g)(e) = f(e)g(e). 

(b) Use Theorem 2.5 to show that if e is a character on G, then 

e(c) = 0 whenever c £ G and c ^ 1 a- 

e£(jr 


(c) As a result of (b), show that the Fourier series Sf(a) = X^eeG /(e)e(a) of 
a function / 6 V takes the form 


Sf = f*D, 


where D is defined by 


(4) = E '(') = { 'o' l f t hej£; 

eSG 

Since / * D = /, we recover the fact that Sf = f. Loosely speaking, D 
corresponds to a “Dirac delta function” ; it has unit mass 

1 ceG 

and (4) says that this mass is concentrated at the unit element in G. Thus 
D has the same interpretation as the “limit” of a family of good kernels. 
(See Section 4, Chapter 2.) 

Note. The function D reappears in the next chapter as d-| (n). 


4 Problems 

1. Prove that if n and m are two positive integers that are relatively prime, then 

Z(ron) ~ Z(n) x Z(m). 
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[Hint: Consider the map Z (nm) — * Z(n) x Z(m) given by k t— * (fc mod n, k mod 
m), and use the fact that there exist integers x and y such that xn + ym = 1.] 

2.* Every finite abelian group G is isomorphic to a direct product of cyclic 
groups. Here are two more precise formulations of this theorem. 

• If pi , . . . , p s are the distinct primes appearing in the factorization of the 
order of G, then 

G^GWx-xGfc), 

where each G{p) is of the form G(p) = Z (p ri ) x • • • x Z (p re ), with 0 < 
ri < ■ ■ ■ < re (this sequence of integers depends on p of course). This 
decomposition is unique. 

• There exist unique integers d \ , . . . , such that 

di\d 2 , d 2 \d3, •••, dk-i\dk 


and 


G « Z(di) x • • • x Z(dfc). 


Deduce the second formulation from the first. 

3. Let G denote the collection of distinct characters of the finite abelian group 
G. 

(a) Note that if G = Z (N), then G is isomorphic to G. 

(b) Prove that G\ x G 2 = Gi x G 2 - 

(c) Prove using Problem 2 that if G is a finite abelian group, then G is iso- 
morphic to G. 


4.* When p is prime the group Z *(p) is cyclic, and Z *(p) Ri Z (p — 1). 
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Dirichlet’s Theorem 


Dirichlet, Gustav Lejeune (Diiren 1805-Gottingen 1859), 
German mathematician. He was a number theorist at 
heart. But, while studying in Paris, being a very like- 
able person, he was befriended by Fourier and other 
like-minded mathematicians, and he learned analysis 
from them. Thus equipped, he was able to lay the 
foundation for the application of Fourier analysis to 
(analytic) theory of numbers. 

S. Bochner, 1966 


As a striking application of the theory of finite Fourier series, we now 
prove Dirichlet’s theorem on primes in arithmetic progression. This the- 
orem states that if q and l are positive integers with no common factor, 
then the progression 

£■, £ + q, £ + 2 q, £ + 3 q, ...,£+ kq , . . . 

contains infinitely many prime numbers. This change of subject matter 
that we undertake illustrates the wide applicability of ideas from Fourier 
analysis to various areas outside its seemingly narrower confines. In this 
particular case, it is the theory of Fourier series on the finite abelian 
group Z*(q) that plays a key role in the solution of the problem. 


1 A little elementary number theory 

We begin by introducing the requisite background. This involves elemen- 
tary ideas of divisibility of integers, and in particular properties regarding 
prime numbers. Here the basic fact, called the fundamental theorem of 
arithmetic, is that every integer is the product of primes in an essentially 
unique way. 

1.1 The fundamental theorem of arithmetic 

The following theorem is a mathematical formulation of long division. 
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Theorem 1.1 (Euclid’s algorithm) For any integers a and b with 
b > 0, there exist unique integers q and r with 0 < r < b such that 


a = qb + r. 

Here q denotes the quotient of a by b , and r is the remainder, which 
is smaller than b. 

Proof. First we prove the existence of q and r. Let S denote the set 
of all non-negative integers of the form a — qb with q € Z. This set is 
non-empty and in fact S contains arbitrarily large positive integers since 
b 0. Let r denote the smallest element in S, so that 

r = a — qb 

for some integer q. By construction 0 < r, and we claim that r < b. If 
not, we may write r = b + s with 0 < s < r, so b + s — a — qb, which then 
implies 


s = a — (q + 1 )b. 

Hence s € S with s < r, and this contradicts the minimality of r. 
So r < b, hence q and r satisfy the conditions of the theorem. 

To prove uniqueness, suppose we also had a = q\b + ri where 
0 < 7"i < b. By subtraction we find 


(q - qi)b = n-r. 


The left-hand side has absolute value 0 or >6, while the right-hand side 
has absolute value < b. Hence both sides of the equation must be 0, 
which gives q = qi and r = r\. 

An integer a divides b if there exists another integer c such that 
ac — 6; we then write a\b and say that a is a divisor of b. Note that in 
particular 1 divides every integer, and a|a for all integers a. A prime 
number is a positive integer greater than 1 that has no positive divisors 
besides 1 and itself. The main theorem in this section says that any 
positive integer can be written uniquely as the product of prime numbers. 

The greatest common divisor of two positive integers a and b is the 
largest integer that divides both a and b. We usually denote the greatest 
common divisor by gcd(a, b ). Two positive integers are relatively prime 
if their greatest common divisor is 1. In other words, 1 is the only positive 
divisor common to both a and b. 
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Theorem 1.2 // gcd(a, b) = d, then there exist integers x and y such 
that 

ax + by = d. 

Proof. Consider the set S of all positive integers of the form ax + by 
where x, y 6 Z, and let s be the smallest element in S. We claim that s = 
d. By construction, there exist integers x and y such that 

ax + by = s. 

Clearly, any divisor of a and b divides s, so we must have d < s. The proof 
will be complete if we can show that s|a and s|6. By Euclid’s algorithm, 
we can write a = qs + r with 0 < r < s. Multiplying the above by q we 
find qax + qby — qs, and therefore 

qax + qby = a — r. 

Hence r = a(l — qx) + b(—qy). Since s was minimal in S and 0 < r < s, 
we conclude that r = 0, therefore s divides a. A similar argument shows 
that s divides b, hence s = d as desired. 

In particular we record the following three consequences of the theo- 
rem. 

Corollary 1.3 Two positive integers a and b are relatively prime if and 
only if there exist integers x and y such that ax + by — 1. 

Proof. If a and b are relatively prime, two integers x and y with the 
desired property exist by Theorem 1.2. Conversely, if ax + by = 1 holds 
and d is positive and divides both a and b, then d divides 1 , hence d — 1 . 


Corollary 1.4 If a and c are relatively prime and c divides ab, then c 
divides b. In particular, if p is a prime that does not divide a and p 
divides ab, then p divides b. 

Proof. We can write 1 — ax + cy, so multiplying by b we find b = 
abx + cby. Hence c\b. 

Corollary 1.5 If p is prime and p divides the product a\ - ■ ■ a r , then p 
divides ai for some i. 

Proof. By the previous corollary, if p does not divide a\, then p 
divides a^ ■ ■ ■ a r , so eventually p|oj. 

We can now prove the main result of this section. 
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Theorem 1.6 Every positive integer greater than 1 can be factored 
uniquely into a product of primes. 

Proof. First, we show that such a factorization is possible. We 
do so by proving that the set S of positive integers > 1 which do not 
have a factorization into primes is empty. Arguing by contradiction, we 
assume that S ^ 0. Let n be the smallest element of S. Since n cannot 
be a prime, there exist integers a > 1 and b > 1 such that ab = n. But 
then a < n and b < n, so a fi S as well as 6 ^ S'. Hence both a and b 
have prime factorizations and so does their product n. This implies 
n fi S, therefore S is empty, as desired. 

We now turn our attention to the uniqueness of the factorization. Sup- 
pose that n has two factorizations into primes 


n = pip 2 ■■■Pr 
— <7l<?2 

So pi divides qiq- 2 - ■ ■ q s , and we can apply Corollary 1.5 to conclude that 
Pi\qi for some i. Since qi is prime, we must have p\ = qi- Continuing 
with this argument we find that the two factorizations of n are equal up 
to a permutation of the factors. 

We briefly digress to give an alternate definition of the group Z*(q) 
which appeared in the previous chapter. According to our initial defini- 
tion, Z*(g) is the multiplicative group of units in Z(q): those n € Z (q) 
for which there exists an integer m so that 

(1) nm = 1 mod q. 

Equivalently, Z* (q) is the group under multiplication of all integers in 
Z (q) that are relatively prime to q. Indeed, notice that if (1) is satisfied, 
then automatically n and q are relatively prime. Conversely, suppose 
we assume that n and q are relatively prime. Then, if we put a = n 
and b = q in Corollary 1.3, we find 

nx + qy = 1. 

Hence nx = 1 mod q, and we can take m = x to establish the equiva- 
lence. 

1.2 The infinitude of primes 

The study of prime numbers has always been a central topic in arithmetic, 
and the first fundamental problem that arose was to determine whether 
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there are infinitely many primes or not. This problem was solved in 
Euclid’s Elements with a simple and very elegant argument. 

Theorem 1.7 There are infinitely many primes. 

Proof. Suppose not, and denote by pi, ... ,p n the complete set of 
primes. Define 


N = pip 2 ■■■p n + l. 


Since N is larger than any pi , the integer N cannot be prime. Therefore, 
N is divisible by a prime that belongs to our list. But this is also an 
absurdity since every prime divides the product, yet no prime divides 1. 
This contradiction concludes the proof. 

Euclid’s argument actually can be modified to deduce finer results 
about the infinitude of primes. To see this, consider the following prob- 
lem. Prime numbers (except for 2) can be divided into two classes de- 
pending on whether they are of the form 4fc + 1 or 4k + 3, and the above 
theorem says that at least one of these classes has to be infinite. A natu- 
ral question is to ask whether both classes are infinite, and if not, which 
one is? In the case of primes of the form 4k + 3, the fact that the class is 
infinite has a proof that is similar to Euclid’s, but with a twist. If there 
are only finitely many such primes, enumerate them in increasing order 
omitting 3, 


Pi = 7, p 2 = 11, ..., p n , 


and let 


N = 4pip 2 • • ■ p n T 3 . 

Clearly, N is of the form 4k + 3 and cannot be prime since N > p n . 
Since the product of two numbers of the form 4m + 1 is again of the 
form 4m + 1, one of the prime divisors of N, say p, must be of the form 
4k + 3. We must have p 3, since 3 does not divide the product in the 
definition of N. Also, p cannot be one of the other primes of the form 
4k + 3, that is, p ^ pi for i — 1, . . . n, because then p divides the product 
Pi ■■ - Pn but does not divide 3. 

It remains to determine if the class of primes of the form 4k + 1 is 
infinite. A simple-minded modification of the above argument does not 
work since the product of two numbers of the form 4m + 3 is never of the 
form 4m + 3. More generally, in an attempt to prove the law of quadratic 
reciprocity, Legendre formulated the following statement: 
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If q and £ are relatively prime, then the sequence 
t + kq, k G Z 

contains infinitely many primes (hence at least one prime!). 

Of course, the condition that q and l be relatively prime is necessary, 
for otherwise t + kq is never prime. In other words, this hypothesis says 
that any arithmetic progression that could contain primes necessarily 
contains infinitely many of them. 

Legendre’s assertion was proved by Dirichlet. The key idea in his proof 
is Euler’s analytical approach to prime numbers involving his product 
formula, which gives a strengthened version of Theorem 1.7. This insight 
of Euler led to a deep connection between the theory of primes and 
analysis. 

The zeta function and its Euler product 

We begin with a rapid review of infinite products. If {A n }ff =1 is a se- 
quence of real numbers, we define 

oo N 

n A n = lim TT A n 

N — >-oo 

n= 1 n= 1 

if the limit exists, in which case we say that the product converges. The 
natural approach is to take logarithms and transform products into sums. 
We gather in a lemma the properties we shall need of the function log x, 
defined for positive real numbers. 

Lemma 1.8 The exponential and logarithm functions satisfy the follow- 
ing properties: 

(i) e logx - x. 

(ii) log(l + x) = x + E(x) where \E{x)\ < x 2 if |x| < 1/2. 

(iii) //log(l + x) = y and |x| < 1/2, then \y\ < 2\x\. 

In terms of the O notation, property (ii) will be recorded as 
log(l + x) — x + 0(x 2 ). 

Proof. Property (i) is standard. To prove property (ii) we use the 
power series expansion of log(l + x) for |x| < 1, that is, 


Ibookroot October 20, 2007 


1. A little elementary number theory 247 

Then we have 

t 2 3 4 

E(x) = log(l + x)-x = - Y + --- + - , 
and the triangle inequality implies 

\E{x)\ < y (1 + |x| + |x| 2 + •■■)• 

Therefore, if |x| <1/2 we can sum the geometric series on the right-hand 
side to find that 


\E(x)\ < 


“ 2 
<x 2 . 


1 1 
2 + 2 ^ + 


1 


1 - 1/2 


The proof of property (iii) is now immediate; if x ^ 0 and |x| < 1/2, then 


log(l + x) 

<1 + 

E(x) 

X 


X 


< 1 + \x 
< 2 , 


and if x — 0, (iii) is clearly also true. 

We can now prove the main result on infinite products of real numbers. 

Proposition 1.9 If A n — 1 + a n and Y2 |a„] converges, then the prod- 
uct ]/[n A n converges, and this product vanishes if and only if one of 
its factors A n vanishes. Also, if a n ^ 1 for all n, then JJ n 1/(1 — a n ) 
converges. 

Proof. If Y2 \ a n\ converges, then for all large n we must have |a n | < 
1/2. Disregarding finitely many terms if necessary, we may assume that 
this inequality holds for all n. Then we may write the partial products 
as follows: 

N N 

— glog(l+a n ) _ gSjv^ 
n= 1 n=l 

where B jy = Y2n= l ^ n with b n — log(l + a n ). By the lemma, we know 
that b n | < 2|a n |, so that B ^ converges to a real number, say B. Since 
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the exponential function is continuous, we conclude that e BN converges 
to e B as N goes to infinity, proving the first assertion of the proposition. 
Observe also that if 1 + a n ^ 0 for all n, the product converges to a 
non-zero limit since it is expressed as e B . 

Finally observe that the partial products of — a n ) are 

l/rin=l(l — a n ), so the same argument as above proves that the product 
in the denominator converges to a non- zero limit. 

With these preliminaries behind us, we can now return to the heart of 
the matter. For s a real number (strictly) greater than 1, we define the 

zeta function by 


«*> - £ if- 


n = 1 


To see that the series defining £ converges, we use the principle that 
whenever / is a decreasing function one can compare /( n ) with 
/ f(x) dx, as is suggested by Figure 1. Note also that a similar tech- 
nique was used in Chapter 3, that time bounding a sum from below by 
an integral. 



Figure 1 . Comparing sums with integrals 


Here we take f(x) — l/x s to see that 


OO oo 

E ^ - 1 + ^J lX 

n= I n n=2 Jn ~ lX 


dx dx 

- = 1 + / a?’ 
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and therefore, 

(3) COO < 1 + s _ £< 

Clearly, the series dehning £ converges uniformly on each half-line 
s > sq > 1, hence is continuous when s > 1. The zeta function was 
already mentioned earlier in the discussion of the Poisson summation 
formula and the theta function. 

The key result is Euler’s product formula. 

Theorem 1.10 For every s > 1, we have 

«»> = nrvV 

where the product is taken over all primes. 

It is important to remark that this identity is an analytic expression 
of the fundamental theorem of arithmetic. In fact, each factor of the 
product 1/(1 — p~ s ) can be written as a convergent geometric series 

1 1 1 
1 ^2s + ' ' ' + pMs + ' ' ’ ' 

So we consider 



where the product is taken over all primes, which we order in increasing 
order pi < P 2 < ■ ■ ■ ■ Proceeding formally (these manipulations will be 
justified below), we calculate the product as a sum of terms, each term 
originating by picking out a term 1 jp k - s (in the sum corresponding to 
Pj ) with a k, which of course will depend on j, and with k = 0 for j 
sufficiently large. The product obtained this way is 

1 _ 1 

{.P k iP k 2 2 -'-p k ™) s nS ' 

where the integer n is written as a product of primes n — p^p^ 2 • • • • 

By the fundamental theorem of arithmetic, each integer > 1 occurs in 
this way uniquely, hence the product equals 
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We now justify this heuristic argument. 

Proof. Suppose M and N are positive integers with M > N. Observe 
now that any positive integer n < N can be written uniquely as a product 
of primes, and that each prime must be less than or equal to N and 
repeated less than M times. Therefore 



Letting N tend to infinity now yields 




For the reverse inequality, we argue as follows. Again, by the fundamen- 
tal theorem of arithmetic, we find that 



Letting M tend to infinity gives 


Hence 



and the proof of the product formula is complete. 

We now come to Euler’s version of Theorem 1.7, which inspired Dirich- 
let’s approach to the general problem of primes in arithmetic progression. 
The point is the following proposition. 
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Proposition 1.11 The series 


E 1 /^ 

p 

diverges, when the sum is taken over all primes p. 

Of course, if there were only finitely many primes the series would 
converge automatically. 

Proof. We take logarithms of both sides of the Euler formula. Since 
log a; is continuous, we may write the logarithm of the infinite product 
as the sum of the logarithms. Therefore, we obtain for s > 1 

“E 1 °S( 1 - 1 /P S ) = lo gC(s)- 

V 

Since log(l + x) = x + 0(\x\ 2 ) whenever \x\ < 1/2, we get 

-E [~ l /p s + °( l /p 28 )\ =1 °gC(s), 

v 

which gives 


Ew + °(i) = 1 °gc(s). 

p 

The term 0(1) appears because J2 P 1/ P 2s < E^°=i l/^ 2 - Now we let s 
tend to 1 from above, namely s — > 1 + , and note that f(s) — > oo since 
E^°= 1 1 / nS > Ef=i l/ nS ! and therefore 

lim inf 

S— >1+ 

We conclude that J2 P 
s > 1, we finally have 

EV:P = oo. 
v 


M 


E i/ riS — E V n f° r ever y 


n= 1 


n= 1 


l/p s — > oo as s — > 1+, and since 1 jp> l/p s for all 
that 


In the rest of this chapter we see how Dirichlet adapted Euler’s insight. 
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2 Dirichlet’s theorem 

We remind the reader of our goal: 

Theorem 2.1 If q and I are relatively prime positive integers, then there 
are infinitely many primes of the form I + kq with k £ Z. 

Following Euler’s argument, Dirichlet proved this theorem by showing 
that the series 



diverges, where the sum is over all primes congruent to I modulo q. Once 
q is fixed and no confusion is possible, we write p = I to denote a prime 
congruent to l modulo q. The proof consists of several steps, one of 
which requires Fourier analysis on the group Z*(q). Before proceeding 
with the theorem in its complete generality, we outline the solution to 
the particular problem raised earlier: are there infinitely many primes of 
the form 4 k + 1? This example, which consists of the special case q — 4 
and l — 1, illustrates all the important steps in the proof of Dirichlet’s 
theorem. 

We begin with the character on Z*(4) defined by x(l) = 1 and 
x(3) = — 1. We extend this character to all of Z as follows: 



Note that this function is multiplicative, that is, x(tjto) — x( n )x( m ) on 
all of Z. Let L(s,x ) = x( n )/ nS , so that 


T , . . 1 1 1 

L (S, J+'"* 


Then L( l, x) is the convergent series given by 



Since the terms in the series are alternating and their absolute values 
decrease to zero we have L(l,x) 0. Because x is multiplicative, the 
Euler product generalizes (as we will prove later) to give 



OO 


i - x(p)/p s ' 
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Taking the logarithm of both sides, we find that 

logL( S , X ) = E^¥ + °( 1 )- 

v P 


Letting s — > 1 + , the observation that L(l, x) 7^ 0 shows that J2 P x(p)/p s 
remains bounded. Hence 


E 


1 

pS 


E 


1 

pS 


is bounded as s — > 1 + . However, we know from Proposition 1.11 that 


E 


1 

pS 


is unbounded as s — > 1 + , so putting these two facts together, we find 
that 


^E 


1 

pS 


is unbounded as s — > 1 + . Hence 1 /p diverges, and as a consequence 
there are infinitely many primes of the form 4 k + 1. 


We digress briefly to show that in fact L(l,x)=vr/4. To see this, we 
integrate the identity 


1 


and get 


1 + x 2 

ry dx 


= 1 - x 2 + x 4 - x b + 


= y 


y_ 

3 


5 


0 < y < 1. 


Jo 1 + x 2 

We then let y tend to 1. The integral can be calculated as 

f 1 dx 1 7T 

/ ; o ~ arctanu L = — , 

Jo 1+x 2 10 4’ 


so this proves that the series 1 — 1/3 +1/5 — is Abel summable to 
7t/4. Since we know the series converges, its limit is the same as its Abel 
limit, hence 1 — 1/3 + 1/5 — ■ ■ ■ = 7r/4. 


The rest of this chapter gives the full proof of Dirichlet’s theorem. We 
begin with the Fourier analysis (which is actually the last step in the 
example given above), and reduce the theorem to the non- vanishing of 
L-functions. 
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2.1 Fourier analysis, Dirichlet characters, and reduction of the 
theorem 

In what follows we take the abelian group G to be Z*(g). Our formulas 
below involve the order of G , which is the number of integers 0 < n < 
q that are relatively prime to g; this number defines the Euler phi- 
function <p(q), and |G| = tp(q). 

Consider the function S( on G, which we think of as the characteristic 
function of t\ if n E Z*(g), then 



1 if n = i mod q, 
0 otherwise. 


We can expand this function in a Fourier series as follows: 





where 



Hence 



We can extend the function 5? to all of Z by setting 8i{m) = 0 whenever m 
and q are not relatively prime. Similarly, the extensions of the characters 


e £ G to all of Z which are given by the recipe 

e(m) if m and q are relatively prime 
0 otherwise, 



are called the Dirichlet characters modulo q. We shall denote the 
extension to Z of the trivial character of G by XO; so that Xo( m ) — 1 if 
m and q are relatively prime, and 0 otherwise. Note that the Dirichlet 
characters modulo q are multiplicative on all of Z, in the sense that 


x(nm ) = x( n )x( m ) for all n, m € Z. 


Since the integer q is fixed, we may without fear of confusion, speak of 
“Dirichlet characters” omitting reference to q. 1 

With |G| = ip(q), we may restate the above results as follows: 


1 We use the notation \ instead of e to distinguish the Dirichlet characters (defined on 
Z) from the characters e (defined on Z *(<?)). 
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Lemma 2.2 The Dirichlet characters arc multiplicative. Moreover, 



where the sum is over all Dirichlet characters. 

With the above lemma we have taken our first step towards a proof of 
the theorem, since this lemma shows that 



Thus it suffices to understand the behavior of J2 p x(p)'P s as s — > 1 + . In 


fact, we divide the above sum in two parts depending on whether or not 
X is trivial. So we have 



Since there are only finitely many primes dividing q , Euler’s theorem 
(Proposition 1.11) implies that the first sum on the right-hand side di- 
verges when s tends to 1. These observations show that Dirichlet’s the- 
orem is a consequence of the following assertion. 

Theorem 2.3 If x is a nontrivial Dirichlet character, then the sum 



p 


remains bounded as s — > 1 + . 


The proof of Theorem 2.3 requires the introduction of the L- functions, 
to which we now turn. 

2.2 Dirichlet T-functions 

We proved earlier that the zeta function (( s ) = V nS could be ex- 
pressed as a product, namely 
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Dirichlet observed an analogue of this formula for the so-called L- functions 
defined for s > 1 by 


L(s,x) 


y X(n) 

2.^1 n s 
71—1 


where % is a Dirichlet character. 
Theorem 2.4 If s > 1, then 


y XW 

“ n s 

71 = 1 


n 


i 

(i - x(p)p~ s Y 


where the product is over all primes. 

Assuming this theorem for now, we can follow Euler’s argument for- 
mally: taking the logarithm of the product and using the fact that 
log(l + x) — x + 0(x 2 ) whenever x is small, we would get 


log L(s, x) = - E 1 °S( 1 “ X(P)/P S ) 


= -E 


x(p) 


p 


p a 


+ 0 


p- 


P ** 


If L(l,x) is finite and non-zero, then log L(s,x) is bounded as s — > 1 + , 
and we can conclude that the sum 

y- X(p) 

2-^ ps 

p r 


is bounded as s — > 1 + . We now make several observations about the 
above formal argument. 

First, we must prove the product formula in Theorem 2.4. Since the 
Dirichlet characters x can b e complex-valued we will extend the loga- 
rithm to complex numbers w of the form w — 1/(1 — z) with \z\ < 1. 
(This will be done in terms of a power series.) Then we show that with 
this definition of the logarithm, the proof of Euler’s product formula 
given earlier carries over to L-functions. 

Second, we must make sense of taking the logarithm of both sides of the 
product formula. If the Dirichlet characters are real, this argument works 
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and is precisely the one given in the example corresponding to primes 
of the form 4fc + 1. In general, the difficulty lies in the fact that x(p) is 
a complex number, and the complex logarithm is not single valued; in 
particular, the logarithm of a product is not the sum of the logarithms. 

Third, it remains to prove that whenever x 7^ Xo> then log L(s,x) is 
bounded as s — > 1 + . If (as we shall see) L(s,x) is continuous at s = 1 , 
then it suffices to show that 


£( 1 ) x) 7 ^ 0 . 

This is the non- vanishing we mentioned earlier, which corresponds to the 
alternating series being non-zero in the previous example. The fact that 
L(l, x) 7^ 0 is the most difficult part of the argument. 

So we will focus on three points: 

1. Complex logarithms and infinite products. 

2. Study of L(s,x)- 

3. Proof that L(l,%) ^ 0 if X is non-trivial. 

However, before we enter further into the details, we pause briefly to 
discuss some historical facts surrounding Dirichlet’s theorem. 

Historical digression 

In the following list, we have gathered the names of those mathematicians 
whose work dealt most closely with the series of achievements related to 
Dirichlet’s theorem. To give a better perspective, we attach the years in 
which they reached the age of 35: 

Euler 1742 
Legendre 1787 
Gauss 1812 
Dirichlet 1840 
Riemann 1861 

As we mentioned earlier, Euler’s discovery of the product formula for 
the zeta function is the starting point in Dirichlet’s argument. Legendre 
in effect conjectured the theorem because he needed it in his proof of the 
law of quadratic reciprocity. However, this goal was first accomplished 
by Gauss who, while not knowing how to establish the theorem about 
primes in arithmetic progression, nevertheless found a number of different 
proofs of quadratic reciprocity. Later, Riemann extended the study of 
the zeta function to the complex plane and indicated how properties 
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related to the non-vanishing of that function were central in the further 
understanding of the distribution of prime numbers. 

Dirichlet proved his theorem in 1837. It should be noted that Fourier, 
who had befriended Dirichlet when the latter was a young mathematician 
visiting Paris, had died several years before. Besides the great activity in 
mathematics, that period was also a very fertile time in the arts, and in 
particular music. The era of Beethoven had ended only ten years earlier, 
and Schumann was now reaching the heights of his creativity. But the 
musician whose career was closest to Dirichlet was Felix Mendelssohn 
(four years his junior). It so happens that the latter began composing 
his famous violin concerto the year after Dirichlet succeeded in proving 
his theorem. 

3 Proof of the theorem 

We return to the proof of Dirichlet ’s theorem and to the three difficulties 
mentioned above. 

3.1 Logarithms 

The device to deal with the first point is to define two logarithms, one 
for complex numbers of the form 1/(1 — z) with \z\ < 1 which we denote 
by logj , and one for the function L(s,x) which we will denote by log 2 . 

For the first logarithm, we define 



for \z\ < 1. 


Note that \og 1 w is then defined if Re(«;) > 1/2, and because of equa- 
tion (2), log x w gives an extension of the usual log a; when x is a real 
number >1/2. 

Proposition 3.1 The logarithm function logj satisfies the following prop- 
erties: 

(i) If \z\ < 1, then 



1 - 2 


(ii) If W < 1, then 



where the error E\ satisfies \Ei(z)\ < |ar | 2 if \z\ < 1/2. 
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(iii) If \z\ < 1/2, then 


1 


logi 


1 - z 


< 2\z\. 


Proof. To establish the hrst property, let z = re* 0 with 0 < r < 1, 
and observe that it suffices to show that 



( 5 ) 


To do so, we differentiate the left-hand side with respect to r, and this 
gives 



The term in brackets equals 




e ie + (1 - re ie )e ie ^(re* 0 )*" 1 


Having found that the left-hand side of the equation (5) is constant, we 
set r = 0 and get the desired result. 

The proofs of the second and third properties are the same as their 
real counterparts given in Lemma 1.8. 

Using these results we can state a sufficient condition guaranteeing 
the convergence of infinite products of complex numbers. Its proof is the 
same as in the real case, except that we now use the logarithm log^ 

Proposition 3.2 If \a n \ converges, and a n ^ 1 for all n, then 



converges. Moreover, this product is non-zero. 

Proof. For n large enough, \a n \ < 1/2, so we may assume without 
loss of generality that this inequality holds for all n > 1. Then 
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But we know from the previous proposition that 

1 


logi 


1 - 2 


< 2\z\, 


so the fact that the series \a n \ converges, immediately implies that the 
limit 


N f 1 

lim V log, I - — 

n — >oo \1 — a r 

n= 1 ' 


= A 


exists. Since the exponential function is continuous, we conclude that 
the product converges to e A , which is clearly non- zero. 


We may now prove the promised Dirichlet product formula 


Y' xH 



n 


n 


i 

(1 - x{p)p~ s )' 


For simplicity of notation, let L denote the left-hand side of the above 
equation. Define 

= E and U N - [] ( * , ) • 

n<N p<N V X\PJP / 

The infinite product If = linijv^oo II /v = lip ( i- x (p)p-» ) converges by 
the previous proposition. Indeed, if we set a n — x(Pn)Pn S > where p n is 
the n th prime, we note that if s > 1, then ^ \a n \ < oo. 

Also, define 


lljV.M 


n 

p<N 



x(p) 

pS 


H + 


x(p M ) \ 
pMs J 


Now fix e > 0 and choose N so large that 


\Sn — L | < e and |IIjv — H| < e. 


We can next select M large enough so that 

\Sn — II n,m | < e and |Hjv,M — IIjv| < e. 

To see the first inequality, one uses the fundamental theorem of arith- 
metic and the fact that the Dirichlet characters are multiplicative. The 
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second inequality follows merely because each series Y^=i con- 
verges. 

Therefore 


| L — n| < | L — S'jvl + |*5 at — IIjv,m| + — Ilwl + in^v — n| < 4e, 

as was to be shown. 

3.2 L-functions 

The next step is a better understanding of the L-functions. Their behav- 
ior as functions of s (especially near s = 1) depends on whether or not x 
is trivial. In the first case, L(s,x o) is up to some simple factors just the 
zeta function. 

Proposition 3.3 Suppose \o is the trivial Dirichlet character, 

, . _ J 1 if n and q are relatively prime, 

Xo( n ) | q otherwise, 

and q — pf 1 ■ ■ is the prime factorization of q. Then 

L(s,xo) = (1 ~ Pi s ){l - P 2 S ) ' ' ' (1 ~Pn S )C( s )- 

Therefore L(s, xo) °° as s — t 1 + . 

Proof. The identity follows at once on comparing the Dirichlet and 
Euler product formulas. The final statement holds because £(s) — » oo as 
s -> 1+. 

The behavior of the remaining L- functions, those for which x¥" Xo> 
is more subtle. A remarkable property is that these functions are now 
defined and continuous for s > 0. In fact, more is true. 

Proposition 3.4 Ifx is a non-trivial Dirichlet character, then the series 

OO 

X(n)/n s 

n = 1 

converges for s > 0, and we denote its sum by L(s,x)- Moreover: 

(i) The function L{s,x) is continuously differentiable for 0 < s < oo. 

(ii) There exists constants c,c' >0 so that 

L(s ,x) — 1 + 0(e~ cs ) as s — > oo, and 

L’{s,x) = 0(e~ c ' s ) ass 


OO. 
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We first isolate the key cancellation property that non-trivial Dirichlet 
characters possess, which accounts for the behavior of the L-function 
described in the proposition. 


Lemma 3.5 If x a non-trivial Dirichlet character, then 


Y *( n ) 

71—1 


< q, 


for any k. 


Proof. First, we recall that 


9 

*(") = °- 

71=1 


In fact, if S denotes the sum and a € Z*(g), then the multiplicative 
property of the Dirichlet character x gives 

x(a)S = Y x{a)x(n) = Y x{an) = Y x{n) = S. 

Since x is non-trivial, x( a ) 7^ 1 f° r some a, hence 5 = 0. We now write 
k = aq + b with 0 < b < q, and note that 

k aq 

Y = + x(n)= Y 

n=l 71=1 aq<n<aq-\-b aq<n<aq-\-b 


and there are no more than q terms in the last sum. The proof is complete 
once we recall that |x(n)| < 1. 

We can now prove the proposition. Let s^, = X^=i x( n )> and s o — 0. 
We know that L(s, x) is defined for s > 1 by the series 

y' X(n) 

, n s 

71=1 

which converges absolutely and uniformly for s > 6 > 1. Moreover, the 
differentiated series also converges absolutely and uniformly for s > S > 
1 , which shows that L(s,x ) is continuously differentiable for s > 1 . We 
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sum by parts 2 to extend this result to s > 0. Indeed, we have 


N 


E 


x{k) 

k s 


N 


E 

k = 1 

N - 1 

E Sk 


Sk ^k — 1 

k s 


k= 1 

N-l 


1 


(k + iy 


E ^ fe ( s ) + AbT’ 


sn_ 
+ N s 


k= 1 


77 s 


where //-(s) = S& [A; -8 — (k + 1)~ 8 ]. If g(x) = x -8 , then g'(x) = — sx -8 ' 1 , 
so applying the mean- value theorem between x — k and x = k + 1 , and 
the fact that |s*. | < q, we find that 


\fk(s)\ <qsk s 1 . 


Therefore, the series fk(s) converges absolutely and uniformly for s > 
S > 0, and this proves that L(s, x) is continuous for s > 0. To prove that 
it is also continuously differentiable, we differentiate the series term by 
term, obtaining 

Again, we rewrite this series using summation by parts as 

E Sk [ ~ k ~ S lo g k+ (k + l) _s log(fc + 1)] , 

and an application of the mean-value theorem to the function g(x) — 
x _s logx shows that the terms are 0(k~ s / 2 ~ 1 ), thus proving that the 
differentiated series converges uniformly for s > 5 > 0. Hence L(s,x) is 
continuously differentiable for s > 0. 

Now, observe that for all s large, 

OO 

I L(s,x) - 1| < 2 qJ2 n ~ S 

71 = 2 

< 2 _s O(l), 


and we can take c = log 2, to see that L(s, x) = 1 + 0(e~ cs ) as s — > oo. 
A similar argument also shows that L'(s,x ) — 0(e~ c s ) as s — > oo with 
in fact c' = c, and the proof of the proposition is complete. 


2 For the formula of summation by parts, see Exercise 7 in Chapter 2. 
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With the facts gathered so far about L(s, x) we are in a position to 
define the logarithm of the L-functions. This is done by integrating its 
logarithmic derivative. In other words, if x is a non-trivial Dirichlet 
character and s > 1 we define 3 


We know that L(t, x) ^ 0 for every t > 1 since it is given by a product 
(Proposition 3.2), and the integral is convergent because 


L'{t,x) 
L{t, x) 


0(e~ ct ), 


which follows from the behavior at infinity of L(t, x) and L'(t, x) recorded 
earlier. 

The following links the two logarithms. 


Proposition 3.6 If s > 1, then 

e log s l(.,x) = i(j , x) . 

Moreover 

l ° g 2 L(s, x) — ^ logi (j— - ~ { 


- x(p)/p i 


Proof. Differentiating e log2 L ^’ x ^T(s, y) with respect to s gives 


Lfs,x) 
L(s, x) 


e ~ log2 i(s ’ x) L(s, x) + e“ log2 i(s ’ x) T'(s, x) = 0. 


So e ~ log2 l ( s ’ x ^L(s, x) is constant, and this constant can be seen to be 1 
by letting s tend to infinity. This proves the first conclusion. 

To prove the equality between the logarithms, we fix s and take the ex- 
ponential of both sides. The left-hand side becomes e log2 L ( s > x ) = L(s, x), 
and the right-hand side becomes 


gS P * ogi ( i-\(p)/p s ) 


e' ogl ( !-x(p)/p s ) = 

p 



1 

i - x(p)/p 


L(s,x), 


3 The notation log 2 used in this context should not be confused with the logarithm to 
the base 2. 
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by (i) in Proposition 3.1 and the Dirichlet product formula. Therefore, 
for each s there exists an integer M(s) so that 



As the reader may verify, the left-hand side is continuous in s, and this 
implies the continuity of the function M{s). But M{s) is integer- valued 
so we conclude that M(s) is constant, and this constant can be seen to 
be 0 by letting s go to inhnity. 

Putting together the work we have done so far gives rigorous meaning 
to the formal argument presented earlier. Indeed, the properties of log! 
show that 



Now if L(l,x) 7 ^ 0 for a non-trivial Dirichlet character, then by its in- 
tegral representation log 2 L(s,x) remains bounded as s —> 1 + . Thus 
the identity between the logarithms implies that J2 p x(p)p~ s remains 
bounded as s — > 1 + , which is the desired result. Therefore, to finish the 
proof of Dirichlet’s theorem, we need to see that L( 1, x) ^ 0 when % is 
non-trivial. 

3.3 Non-vanishing of the L-function 

We now turn to a proof of the following deep result: 

Theorem 3.7 Ifx^ Xo, then L( l,x) + 0. 

There are several proofs of this fact, some involving algebraic number 
theory (among them Dirichlet’s original argument), and others involving 
complex analysis. Here we opt for a more elementary argument that 
requires no special knowledge of either of these areas. The proof splits 
in two cases, depending on whether % is complex or real. A Dirich- 
let character is said to be real if it takes on only real values (that is, 
+ 1, —1, or 0) and complex otherwise. In other words, x is real if and 
only if x( n ) — x( n ) f° r all integers n. 
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Case I: complex Dirichlet characters 

This is the easier of the two cases. The proof is by contradiction, and we 
use two lemmas. 

Lemma 3.8 If s > 1, then 

n L ( s ’ X ) ^ L 

x 

where the product is taken over all Dirichlet characters. In particular the 
product is real-valued. 

Proof. We have shown earlier that for s > 1 

L(,,X> = e*r(po gl ( i _i^)). 

Hence, 


Y[ l ( S , x ) = exp (e E lo gi ( 

/ oo 

= «P EEE 


X P k = 1 

OO 


= exp EEE 


p k= i x 


- x(p)p~ 
i x(p k ) \ 

k p ks J 

i x(p k ) \ 

k p ks ) 


Because of Lemma 2.2 (with I = 1) we have x(p k ) — p(q)di(p k )i and 
hence 


p k= 1 


k p ks ) 


since the term in the exponential is non-negative. 


Lemma 3.9 The following three properties hold: 

(i) V L( l,x) = 0, then L(l,x) = 0- 

(ii) If x i s non-trivial and L{ 1,%) = 0, then 

|i(s,x)| < C7|s — l] whenl<s<2. 
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(iii) For the trivial Dirichlet character \o > we have 

(j 

|L(s,Xo)| < i r when 1 < s < 2. 

s — 1 


Proof. The first statement is immediate because L(l,x) — L(l,x)- 
The second statement follows from the mean-value theorem since L(s, x) 
is continuously differentiable for s > 0 when x is non-trivial. Finally, the 
last statement follows because by Proposition 3.3 

L{s,xo) = (1 ~ Pf s ){^ ~ P 2 S ) ■ ■ ■ i 1 ~ Pn S )C(s), 

and £ satisfies the similar estimate (3). 

We can now conclude the proof that L( l,x) 7 ^ 0 for % a non-trivial 
complex Dirichlet character. If not, say L(l,x) = 0, then we also have 
L(l, x) — 0- Since x 7^ X> there are at least two terms in the product 

x 

that vanish like |s — 1| ass— >1 + . Since only the trivial character con- 
tributes a term that grows, and this growth is no worse than 0(l/|s — 1|), 
we find that the product goes to 0 as s — > 1 + , contradicting the fact that 
it is > 1 by Lemma 3.8. 

Case II: real Dirichlet characters 

The proof that L( 1, %) ^ 0 when y is a non-trivial real Dirichlet character 
is very different from the earlier complex case. The method we shall 
exploit involves summation along hyperbolas. It is a curious fact that 
this method was introduced by Dirichlet himself, twelve years after the 
proof of his theorem on arithmetic progressions, to establish another 
famous result of his: the average order of the divisor function. However, 
he made no connection between the proofs of these two theorems. We will 
instead proceed by proving first Dirichlet’s divisor theorem, as a simple 
example of the method of summation along hyperbolas. Then, we shall 
adapt these ideas to prove the fact that L( 1,%) ^ 0. As a preliminary 
matter, we need to deal with some simple sums, and their corresponding 
integral analogues. 

Sums vs. Integrals 

Here we use the idea of comparing a sum with its corresponding integral, 
which already occurred in the estimate (3) for the zeta function. 
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Proposition 3.10 If N 


is a positive integer, then: 


(i) 

(ii) 


1 C N fhr 

E -= / — + 0(1) -log N + 0(1). 

l<n<N n Jl x 

More precisely, there exists a real number 7 , called Euler’s constant, 
so that 


E - = logJV + 7 + 0(l/iV). 

l<n< AT n 


Proof. It suffices to establish the more refined estimate given in 
part (ii). Let 




dx 

x 


Since 1 /x is decreasing, we clearly have 


0 < 7n < - 
n 


1 1 

—7 T < ~ 2 > 
n + 1 n z 


so the series 7n converges to a limit which we denote by 7. More- 

over, if we estimate f(n) by / /(x) dx, where f(x) — 1 /x 2 , we find 


OO OO /»00 7 

E i.< E i</ % = oam- 

n^+1 n=N+i n Jn X 


Therefore 


1 /-JV 00 

E - / - = 7- E 

• ^ m I * nr < ^ 


n = 1 


n Ji x 


n=N-\-l 


r N+1 dx 
In x ’ 


and this last integral is 0(1/N) as N — > 00 . 


Proposition 3.11 If N is a positive integer, then 


E 

l<n<JV 


1 

n V2 


f 7T + C' + 0(1/M7 

2A1 1 ' /2 + c+0(1/1V 1/2 ). 


The proof is essentially a repetition of the proof of the previous proposi- 
tion, this time using the fact that 


1 


1 

(n + l ) 1 / 2 


C 

< T~. 

~ n 3 / 2 
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This last inequality follows from the mean-value theorem applied to 
f(x) = X -1 / 2 , between x = n and x — n + 1. 

Hyperbolic sums 

If F is a function defined on pairs of positive integers, there are three 
ways to calculate 

S N = ^2^2F(m,n), 

where the sum is taken over all pairs of positive integers (m, n) which 
satisfy mn < N . 

We may carry out the summation in any one of the following three 
ways. (See Figure 2.) 

(a) Along hyperbolas: 


Sn= Yl ( H F(m,n) 


l<k<N \nm=k 


(b) Vertically: 


Sn= 51 F ( m ’ n ) 

l<m<N \l<n<N/m 


(c) Horizontally: 


Sn — 


E ( 


F(m,n ) 

l<n<N \l<m<N/n 


It is a remarkable fact that one can obtain interesting conclusions from 
the obvious fact that these three methods of summation give the same 
sum. We apply this idea first in the study of the divisor problem. 

Intermezzo: the divisor problem 

For a positive integer k. let d(k) denote the number of positive divisors 
of k. For example, 


k 

i 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

d(k) 

i 

2 

2 

3 

2 

4 

2 

4 

3 

4 

2 

6 

2 

4 

4 

5 

2 
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Figure 2. The three methods of summation 


One observes that the behavior of d(k) as k tends to infinity is rather 
irregular, and in fact, it does not seem possible to approximate d(k) by 
a simple analytic expression in k. However, it is natural to inquire about 
the average size of d(k). In other words, one might ask, what is the 
behavior of 

1 N 

— Y d(k) as N — > oo? 

N h 

The answer was provided by Dirichlet, who made use of hyperbolic sums. 
Indeed, we observe that 

d(k) y i. 

nm=k , l<n,m 

Theorem 3.12 If k is a positive integer, then 

1 N 

— yd{k) = log n + o{i). 

v k= 1 

More precisely, 

1 N 

- y d(k) = log N + (2 7 - 1) + 0(1/IV 1 / 2 ), 

V k= 1 

where 7 is Euler’s constant. 

Proof. Let S jv = Ylk= \d(k). We observed that summing F — 1 along 
hyperbolas gives Sjy. Summing vertically, we find 

s n= y y !■ 

l<m<N l<n<N / m 
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But Y^i<n<N/m 1 — [N/m] = N/m + 0(1), where [x] denote the greatest 
integer < x. Therefore 


S N = (N/m + 0(1)) 

l<m<N 



+ 0(N). 


Hence, by part (i) of Proposition 3.10, 


Sn 

~w 


log IV + 0(1) 


which gives the first conclusion. 

For the more refined estimate we proceed as follows. Consider the 
three regions I, II, and III shown in Figure 3. These are defined by 


I = {1 < m < N l N , N 1 / 2 < n < N/m}, 
II = {1 < m < iV 1/2 , 1 < n < A^ 1/2 }, 

HI = {AT 1 / 2 < m < N/n, 1 <n< N 1 / 2 }. 



If Si, Sn, and Sm denote the sums taken over the regions I, II, and 
III, respectively, then 


Sn — Si + Sn + Sm 
- 2 (Si + Sn) ~ S H , 
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since by symmetry Si = Sni- Now we sum vertically, and use (ii) of 
Proposition 3.10 to obtain 

s, + s„= y. [ E i) 

= Y wh 

lKmKN 1 / 2 

= Y Wm + 0{ 1 )) 

1 <m<iW2 

= AM Y 1/m j +0(iV 1/2 ) 

XlKmKN 1 / 2 J 
= N log N 1 / 2 + IVy + 0(7V 1 / 2 ). 

Finally, Su corresponds to a square so 

S n = Y 1 — [N 1 / 2 ] 2 — N + 0(N 1 ^ 2 ). 

l<m<I>/ 2 lKnKN 1 / 2 

Putting these estimates together and dividing by N yields the more re- 
fined statement in the theorem. 


Non-vanishing of the L-function 

Our essential application of summation along hyperbolas is to the main 
point of this section, namely that L( 1, y) 7^ 0 for a non-trivial real Dirich- 
let character y. 

Given such a character, let 


F(m, n) 


xip) 

( nm ) 1 / 2 ’ 


and define 

s N = Y'52 F (. m i n )’ 

where the sum is over all integers m, n > 1 that satisfy mn < N . 
Proposition 3.13 The following statements are true: 

(i) Sn > clog TV for some constant c > 0. 

(ii) S n = 2N 1 / 2 L(1, x ) + 0(1). 
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It suffices to prove the proposition, since the assumption L(l,x) — 0 
would give an immediate contradiction. 

We first sum along hyperbolas. Observe that 


y- X(n) 
^ , ( nm ) 1 / 2 

nm=k v 7 


l 

F72 




For conclusion (i) it will be enough to show the following lemma. 
Lemma 3.14 V x („) > { J ^ g z 

n\k 

From the lemma, we then get 

S N> Y JJJ 2 — C ^°S N , 
k=P, IKN 1 / 2 


where the last inequality follows from (i) in Proposition 3.10. 

The proof of the lemma is simple. If k is a power of a prime, say 
k = p a , then the divisors of k are 1 ,p,p 2 , . . . ,p a and 

Y x( n ) = x{i) + x(p) + x(p 2 ) + ■ ■ ■ + x(p a ) 

n\k 

= 1 + x(p) + x{p) 2 + • • • + x(p) a - 


So this sum is equal to 

( a + 1 if x(:p) = 1, 

I 1 if x(p) — ~ 1 and a is even, 

I 0 if x(p) = — 1 and a is odd, 

l. 1 if x(p) = 0, that is p\q. 

In general, if k = p^ 1 ■ ■ ■ p ^ , then any divisor of k is of the form p J 1 ■ ■ ■ p ^ 
where 0 < bj < aj for all j . Therefore, the multiplicative property of x 
gives 

N 

Y *( n ) = n (x(!) + x{Pj) + x(p 2 ) + • • • + \U'"i ! )) » 

n\k j = 1 

and the proof is complete. 

To prove the second statement in the proposition, we write 

Sn — Si + (S n + Sin), 
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where the sums Si, Sn, and Sni were defined earlier (see also Figure 3). 
We evaluate Si by summing vertically, and Sn + Sm by summing hor- 
izontally. In order to carry this out we need the following simple results. 


Lemma 3.15 For all integers 0 < a < b we have 




(ii) £ *bl = o ( «-‘). 


Proof. This argument is similar to the proof of Proposition 3.4; we 
use summation by parts. Let s n — Ei<Knl(^)> and remember that 
|s n | < q for all n. Then 


y' x( n ) _ Y' s 

Z ^ n l/2 ~ 


n 


- 1/2 _ 


(n + l) -1 / 2 ! +0(a“ 1 / 2 ) 


= 0 E n_3/2 +0(a“ 1/2 ). 


By comparing the sum Y^= a n ~ 3 ^ 2 with the integral of f(x) — x -3 / 2 , 
we find that the former is also 0(a _1 / 2 ). 

A similar argument establishes (ii). 

We may now finish the proof of the proposition. Summing vertically 
we find 


Si 


E 

mKN 1 / 2 


1 

m 1 / 2 


E X(n)/n 1/2 


\N 1 / 2 <n<N/m 


The lemma together with Proposition 3.11 shows that Si = 0(1). Finally 
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we sum horizontally to get 


Sii + Sin 



l<n<N 1 / 2 


£ ^ {2 (N/n) 1 / 2 + c +0((n/N)^)} 


X(n) 



X{n) 


n 1 / 2 


lKnKN 1 / 2 1 <n<iW 2 



A + B + C. 


Now observe that the lemma, together with the definition of L(s,x ), 
implies 


A = 2N 1 / 2 L(1,x) + 0(iV 1 / 2 N“ 1 / 2 ). 


Moreover, part (i) of the lemma gives B = 0(1), and we also clearly 
have C — 0 ( 1 ). Thus Sn = 2N 1 / 2 L(l,x) + 0 ( 1 ), which is part (ii) in 
Proposition 3.13. 

This completes the proof that L( 1, y) 7 ^ 0, and thus the proof of Dirich- 
let’s theorem. 

4 Exercises 

1. Prove that there are infinitely many primes by observing that if there were 
only finitely many, p\, . . . ,Pn, then 



2 . In the text we showed that there are infinitely many primes of the form 4 k + 3 
by a modification of Euclid’s original argument. Adapt this technique to prove 
the similar result for primes of the form 3fc + 2, and for those of the form 6 k + 5. 


3 . Prove that if p and q are relatively prime, then Z*(p) x Z*(q) is isomorphic 
to Z *(pq). 
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4. Let <p(n) denote the number of positive integers < n that are relatively prime 
to n. Use the previous exercise to show that if n and m are relatively prime, 
then 


<p(nm) = tp(n)tp(m). 

One can give a formula for the Euler phi- function as follows: 

(a) Calculate tp(p) when p is prime by counting the number of elements in 

z*(p)- 

(b) Give a formula for p{p k ) when p is prime and k > 1 by counting the 
number of elements in Jj*(jp k ). 

(c) Show that 

i ^ ' 

where Pi are the primes that divide n. 


5. If n is a positive integer, show that 

n = '52<P{d), 

d\n 


where ip is the Euler phi-function. 

[Hint: There are precisely ip(n/d) integers 1 < m < n with gcd(m, n) = d .] 

6. Write down the characters of the groups Z*(3), Z*(4), Z*(5), Z*(6), and 
Z*(8). 

(a) Which ones are real, or complex? 

(b) Which ones are even, or odd? (A character is even if y(— 1) = 1, and odd 
otherwise). 


7. Recall that for |^| < 1, 




fc> l 


>si(rb) _ 


1 - 2 


We have seen that 
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(a) Show that if w = 1/(1 — z), then \z\ < 1 if and only if Re(w) > 1/2. 

(b) Show that if Re(u>) >1/2 and w = pe tip with p > 0, \<p\ < n, then 

log! w = log p + ip. 

[Hint: If = w , then the real part of C is uniquely determined and its 
imaginary part is determined modulo 27T.] 

8. Let C denote the zeta function defined for s > 1. 

(a) Compare £(s) with x~ s dx to show that 

C(s) = — — - + 0(1) as s — > 1+. 
s — 1 

(b) Prove as a consequence that 

Z ps =1 °g(^l) +°(!) as s — > 1 + . 

p ' ' 


9. Let xo denote the trivial Dirichlet character mod q, and pi, ... ,pk the distinct 
prime divisors of q. Recall that L(s , \o) = (1 — p/ s ) •••(! — p\ 7 s )C(s), and show 
as a consequence 


L{s,Xo) +0(1) as 1 + . 

q S — 1 

[Hint: Use the asymptotics for £ in Exercise 8.J 

10 . Show that if i is relatively prime to q, then 

T — = — 7 -T log ( — ] + O(l) as s — > 1 + . 

<p(q) H 3 - 1 / 

This is a quantitative version of Dirichlet’s theorem. 

[Hint: Recall (4).] 

11 . Use the characters for Z*(3), Z*(4), Z*(5), and Z*( 6 ) to verify directly that 
L(l,x) 7 ^ 0 for all non-trivial Dirichlet characters modulo q when q = 3,4,5, 
and 6 . 

[Hint: Consider in each case the appropriate alternating series.] 
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12 . Suppose x real and non-trivial; assuming the theorem that L( l,x) / 0, 
show directly that L( 1,\) > 0. 

[Hint: Use the product formula for L(s,x).] 

13. Let {fln}[JL-cx) be a sequence of complex numbers such that a n = a m if 
n = m mod q. Show that the series 


OO 



n= 1 


converges if and only if Yln=i ttn = *-*■ 
[Hint: Sum by parts.] 

14. The series 


pinO 

F(9) = , for \0\ < 7r, 

\n\^0 

converges for every 9 and is the Fourier series of the function defined on [— it, 7r] 
by F( 0) = 0 and 


m 


i(—n — 9) if — 7r < 9 < 0 

*(7T — 0) if 0 < 6 < 7T, 


and extended by periodicity (period 2-7 t) to all of R (see Exercise 8 in Chapter 2). 
Show also that if 9 ^ 0 mod 27r, then the series 


m = Y 

71=1 


e inB 

n 


converges, and that 


E(9) 



1 


2 — 2 cos 9 


+ \ F ^~ 


15. To sum the series a n /n, with a n = o m if n = m mod q and a n = 

0, proceed as follows. 

(a) Define 


A(m) = Y a n C rnn where C = e 2 "/ ? . 

71 = 1 
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Note that A(q) = 0. With the notation of the previous exercise, prove 
that 

oo q— 1 

= A ( m ) E ( 2 ^m/q). 

n= 1 77i— 1 

[Hint: Use Fourier inversion on Z(g).[ 

(b) If {a m } is odd, (a_ m = — a m ) for me Z, observe that ao = a q = 0 and 
show that 

H(m)= ^ a n (C TO " - C™)- 

l<?i<q , /2 

(c) Still assuming that {a m } is odd, show that 

oo 9-1 

A(m)F(2irm/ q). 

71=1 771=1 

[Hint: Define A(m) = Y^n=i a nC mn and apply the Fourier inversion for- 
mula.] 

16. Use the previous exercises to show that 

7T 11111 

3\/I ~ 1_ 2 + 4 _ 5 + 7~ 8 + '" ’ 
which is L(l,x) for the non-trivial (odd) Dirichlet character modulo 3. 


5 Problems 

1.* Here are other series that can be summed by the methods in Exercise 15. 

(a) For the non-trivial Dirichlet character modulo 6, L(l, y) equals 

71 _ 111 1 

275 “ ~~ 5 + 7~ Tl + 13 + "' ' 

(b) If x is the odd Dirichlet character modulo 8, then L(l, y) equals 

7T 11111 

171 ~~ + 3 _ 5 _ 7 + 9 + 11 ' 

(c) For an odd Dirichlet character modulo 7, L(l,x) equals 

7T _ 1 1 1 1 1 1 

77” + 2 _ 3 + 4 _ 5 _ 6 ' 
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(d) For an even Dirichlet character modulo 8, L( l,x) equals 

log(l + vg) = 1 1 11 1 

y/2 3 5 7 9 11 


(e) For an even Dirichlet character modulo 5, L( l,y) equals 


2 

7 ^ 


log 


(^) 


1111111 1 
2 _ 3 + 4 + 6 _ 7 _ 8 + 9 + TT 


2. Let d(k) denote the number of positive divisors of k. 

(a) Show that if k = p® 1 • • -p“ n is the prime factorization of k, then 

d(k) = (ai + 1) • • • (a n + 1). 

Although Theorem 3.12 shows that on “average” d(k) is of the order of logic, 
prove the following on the basis of (a): 

(b) d(k) = 2 for infinitely many k. 

(c) For any positive integer N, there is a constant c > 0 so that d(k) > 
c(log k) N for infinitely many k. [Hint: Let p ± , . . . , pjv be N distinct primes, 
and consider k of the form (piP 2 • • • Pn ) 171 for m = 1, 2, . . ..] 


3. Show that if p is relatively prime to q, then 


n 




i 

i — pf s 


g 


where g = tp(q)/ f, and / is the order of p in Z*(q) (that is, the smallest n for 
which p n = 1 mod q). Here the product is taken over all Dirichlet characters 
modulo q. 


4. Prove as a consequence of the previous problem that 


X n>l 


where a n > 0, and the product is over all Dirichlet characters modulo q. 
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This appendix is meant as a quick review of the definition and main 
properties of the Riemann integral on M, and integration of appropriate 
continuous functions on IR d . Our exposition is brief since we assume that 
the reader already has some familiarity with this material. 

We begin with the theory of Riemann integration on a closed and 
bounded interval on the real line. Besides the standard results about 
the integral, we also discuss the notion of sets of measure 0, and give 
a necessary and sufficient condition on the set of discontinuities of a 
function that guarantee its integrability. 

We also discuss multiple and repeated integrals. In particular, we 
extend the notion of integration to the entire space by restricting 
ourselves to functions that decay fast enough at infinity. 

1 Definition of the Riemann integral 

Let / be a bounded real-valued function defined on the closed interval 
[a, b] cl. By a partition P of [a, b] we mean a finite sequence of num- 
bers xq, X\, . . . ,xn with 


a = xq < x\ < ■ • • < Xn- i < %n — b. 

Given such a partition, we let Ij denote the interval [xj-i,Xj\ and write 
I j for its length, namely \Ij\ = Xj — Xj-\. We define the upper and 
lower sums of / with respect to P by 



Note that the infimum and supremum exist because by assumption, / 
is bounded. Clearly U(P , /) > £(P, /), and the function / is said to be 
Riemann integrable, or simply integrable, if for every e > 0 there 
exists a partition P such that 

U(PJ)-C(P,f)<e. 

To define the value of the integral of /, we need to make a simple yet 
important observation. A partition P' is said to be a refinement of the 
partition P if P' is obtained from P by adding points. Then, adding one 
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point at a time, it is easy to check that 

U(P'J)<U(P,f ) and C(P',f) > C(PJ). 

From this, we see that if Pi and P 2 are two partitions of [a, 6], 
then 

U(PiJ)>C(P 2 ,f), 

since it is possible to take P' as a common refinement of both Pi and P 2 
to obtain 


U(Pi, f) > U(P', f) > C(P f) > C{P 2 , /). 
Since / is bounded we see that both 


U — inf U(PJ) 


and L — sup C(P, f ) 

p 


exist (where the infimum and supremum are taken over all partitions of 
[a, b\), and also that U > L . Moreover, if / is integrable we must have 
U — L, and we define f(x) dx to be this common value. 

Finally, a bounded complex- valued function f — u + iv is said to be 
integrable if its real and imaginary parts u and v are integrable, and we 
define 

rb rb nb 

/ f(x) dx= u(x) dx + i v(x) dx. 

J a J a J a 

For example, the constants are integrable functions and it is clear that 
if c € C, then cdx — c(b — a). Also, continuous functions are inte- 
grable. This is because a continuous function on a closed and bounded 
interval [a, b] is uniformly continuous, that is, given e > 0 there exists <5 
such that if \x — y\ < 6 then | f(x) — f(y) \ < e. So if we choose n with 
(b — a)/n < 5, then the partition P given by 

b — a b — a b — a 

a, a - 1 , . . . , a + k , . . . , a + (n — 1) , b 

n n n 

satisfies U{P. f) — C(P, f) < e(b — a). 


1.1 Basic properties 

Proposition 1.1 If f and g are integrable on [a, b\, then: 

(i) / + g is integrable, and f{x) + g(x) dx = f(x) dx + f^ g(x) dx. 
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(ii) If c G C, then f b cf(x) dx = cf b f(x) dx. 

(iii) If f and g are real-valued and f(x) < g(x), then f b f(x)dx 
< Ja9{x)dx. 

(iv) If c € [a, b }, then f b f(x) dx = / a c f(x) dx + f b c f(x) dx. 

Proof. For property (i) we may assume that / and g are real-valued. 

If P is a partition of [a, b] 1 then 

U(P,f + g)<U(P,f)+U(P,g) and C(P, f + g) > C(P, /) + C(P, g). 

Given e > 0, there exist partitions P\ and P 2 such that U (Pi, f) — C(P\, f) < 
e and U(P 2 ,g) — C(P 2 ,g) < e, so that if Pq is a common refinement of 
Pi and P 2 , we get 


U(P 0 , f + g) - C(P 0 , f + g) <2e. 

So / + g is integrable, and if we let / = inf P U (P, f + g) — sup P C(P, f + 
g), then we see that 

I < U(P 0 , f + g) + 2e < U(P 0 J) + U(P 0 , g ) + 2e 

pb rb 

< / f(x) dx+ g(x) dx + 4e. 

J a J a 

Similarly / > f b f(x) dx + f b g(x) dx — 4e, which proves that f b f(x) + 
g(x) dx = f b f(x) dx + f b g(x) dx. The second and third parts of the 
proposition are just as easy to prove. For the last property, simply refine 
partitions of [a, b] by adding the point c. 

Another important property we need to prove is that fg is integrable 
whenever / and g are integrable. 

Lemma 1.2 If f is real-valued integrable on [a, b] and ip is a real-valued 
continuous function on K, then ip o f is also integrable on [a, b } . 

Proof. Let e > 0 and remember that / is bounded, say |/| < M. Since 
p is uniformly continuous on we may choose <5 > 0 so that if 

s,t € [— M,M ] and \s — t\ < 5, then |</?(s) — p(t) \ < e. Now choose a par- 
tition P = {xo, ■ ■ ■ , xn} of [a, b] with U(P, f) — C(P, f ) < 5 2 . Let Ij — 
[xj-i,Xj\ and distinguish two classes: we write j £ A if sup xeIj f(x) — 
inf xe i j f(x) < 5 so that by construction 
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Otherwise, we write j € A' and note that 



so Y^jeA' \Ij\ < Therefore, separating the cases j £ A and j £ A' we 
find that 


U(P, tpof)- C(P, p o /) < e{b - a) + 2 BS, 


where B is a bound for ip on [— M,M ]. Since we can also choose 5 < e, 
we see that the proposition is proved. 

^From the lemma we get the following facts: 


• If / and g are integrable on [a, b\, then the product fg is integrable 
on [a, b]. 

This follows from the lemma with pit) = t 2 , and the fact that fg — 
I {if + 9? ~ [f -g} 2 )- 

If / is integrable on [a, b], then the function |/| is integrable, and 
Jaf(x)dx <f*\f(x)\dx. 


We can take pit) = |t| to see that |/| is integrable. Moreover, the in- 
equality follows from (iii) in Proposition 1.1. 

We record two results that imply integrability. 


Proposition 1.3 A bounded monotonic function f on an interval [a, b] 
is integrable. 


Proof. We may assume without loss of generality that a = 0, b = 1, 
and / is monotonically increasing. Then, for each N, we choose the 
uniform partition Pyv given by Xj — j/N for all j — 0, . . . , N. If aj — 
f{xj), then we have 

1 N 1 N 

U{P N ,f) = —^otj and C{P N , f) = — ^ Oj-i- 

3 = 1 3=1 


Therefore, if \f(x)\ < B for all x we have 


U(P N ,f)-C(P N ,f) 


czn — no ^ 2 B 
N “ ~N’ 


and the proposition is proved. 
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Proposition 1.4 Let f be a bounded function on the compact interval 
[a,b]. If c € ( a,b ), and if for all small 5 > 0 the function f is integrable 
on the intervals [a, c — <5] and [c+S,b], then f is integrable on [a,b\. 

Proof. Suppose |/| < M and let e > 0. Choose 5 > 0 (small) so 
that ASM < e/3. Now let P\ and P 2 be partitions of [a,c— 5] and [c + 
5, 5] so that for each i = 1,2 we have U(Pi, f) — C(Pi, f) < e/3. This is 
possible since / is integrable on each one of the intervals. Then by taking 
as a partition P — P\ U {c — <5} U {c + 5} U P 2 we immediately see that 
U(P,f)-£(PJ)<e. 

We end this section with a useful approximation lemma. Recall that 
a function on the circle is the same as a 27r-periodic function on R. 

Lemma 1.5 Suppose f is integrable on the circle and f is bounded by 
B. Then there exists a sequence {f^^i of continuous functions on the 
circle so that 

SU P \fk(x)\ < B for all k = 1,2,..., 

[ — 7T,7r] 


and 


/ | f{x) — fk(x ) | dx — > 0 as k — > 00 . 

J — 7T 


Proof. Assume / is real-valued (in general apply the following argu- 
ment to the real and imaginary parts separately). Given e > 0, we may 
choose a partition — n — < x\ < ■ ■ ■ < xn = tt of the interval [— 7r,7r] 

so that the upper and lower sums of / differ by at most e. Denote by /* 
the step function defined by 


f*(x) = sup f(y) if x € [xj-uxj) for 1 < j < N. 

Xj — i<.y<Xj 

By construction we have |/*| < B, and moreover 

(1) f \f*(x) - f(x)\dx= f (f*(x) - f(x))dx < e. 

J— 7T J —TV 

Now we can modify f* to make it continuous and periodic yet still ap- 
proximate / in the sense of the lemma. For small <5 > 0, let f(x ) = f*(x) 
when the distance of x from any of the division points xq,...,xn is 
> 5. In the 6- neighborhood of Xj for j — 1, . . . , N — 1, define f(x) to be 
the linear function for which f(xj ± S) = f*(xj ± 5). Near x 0 = r, / 
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/* 





I 



Xl X2 X3 xo Xl f X2\X3 

X2 ~ 5 X2 + S 


Figure 1 . Portions of the functions /* and / 


is linear with f(—n) = 0 and /(— ir + 6) = /*(— ir + 5). Similarly, near 
xn — T the function / is linear with f(n) — 0 and /( ir — 5) = f*(n — 5). 
In Figure 1 we illustrate the situation near xo = —ir. In the second pic- 
ture the graph of / is shifted slightly below to clarify the situation. 

Then, since /(— ir) = /(t), we may extend / to a continuous and 27r- 
periodic function on IR. The absolute value of this extension is also 
bounded by B. Moreover, / differs from f* only in the N intervals of 
length 25 surrounding the division points. Thus 



\f*(x) — f(x) \ dx < 2BN(25). 


If we choose 5 sufficiently small, we get 


( 2 ) 


/ \f*{x) - f(x)\dx < e. 

J — 7T 


As a result, equations (1), (2), and the triangle inequality yield 


1/0*0 - /(a 01 dx < 2e - 


Denoting by /*, the / so constructed, when 2e — 1/k, we see that the 
sequence {/*,} has the properties required by the lemma. 


1.2 Sets of measure zero and discontinuities of integrable func- 
tions 

We observed that all continuous functions are integrable. By modifying 
the argument slightly, one can show that all piecewise continuous func- 
tions are also integrable. In fact, this is a consequence of Proposition 1.4 
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applied finitely many times. We now turn to a more careful study of the 
discontinuities of integrable functions. 

We start with a definition 1 : a subset E of IR. is said to have measure 0 
if for every e > 0 there exists a countable family of open intervals {h}kLi 
such that 

a) ec ur=i4, 

(ii) Y^kLi |4c| < e ! where \I k \ denotes the length of the interval I k . 

The first condition says that the union of the intervals covers E, and the 
second that this union is small. The reader will have no difficulty proving 
that any finite set of points has measure 0. A more subtle argument is 
needed to prove that a countable set of points has measure 0. In fact, 
this result is contained in the following lemma. 

Lemma 1.6 The union of countably many sets of measure 0 has mea- 
sure 0. 

Proof. Say E 1: E 2 , ■ . . are sets of measure 0, and let E — U?l 1 E i . Let 
e > 0, and for each i choose open interval 4 , 1 , A, 2 , • • • so that 

OO OO 

E l C (J 4, fe and ]T \I Lk \ < e/2*. 

k= 1 k= 1 

Now clearly we have E C Uffc=i 4,fei and 

OO OO OO 

E£i'«i<£^<e, 

2=1 k= 1 2=1 


as was to be shown. 

An important observation is that if E has measure 0 and is com- 
pact, then it is possible to find a finite number of open intervals I k , 
k — 1, ... ,N, that satisfy the two conditions (i) and (ii) above. 

We can prove the characterization of Riemann integrable functions in 
terms of their discontinuities. 

Theorem 1.7 A bounded function f on [a,b] is integrable if and only if 
its set of discontinuities has measure 0. 


1 A systematic study of the measure of sets arises in the theory of Lebesgue integration, 
which is taken up in Book III. 
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We write J — [a, 6] and I(c,r) = (c — r,c + r) for the open interval 
centered at c of radius r > 0. Define the oscillation of / on I(c,r) by 

osc (f,c,r) = sup |/0) - f(y)\ 

where the supremum is taken over all x, y £ Jn I(c,r). This quantity 
exists since / is bounded. Define the oscillation of / at c by 

osc (/, c) = lim osc (/, c, r). 

r — >0 

This limit exists because osc (/, c, r) is > 0 and a decreasing function of 
r. The point is that / is continuous at c if and only if osc (/, c) = 0. This 
is clear from the definitions. For each e > 0 we define a set A e by 

A e = {c € J : osc (/, c) > e}. 

Having done that, we see that the set of points in J where / is discon- 
tinuous is simply U e >o -^e- This is an important step in the proof of our 
theorem. 

Lemma 1.8 If e > 0, then the set A e is closed and therefore compact. 

Proof. The argument is simple. Suppose c n € A e converges to c 
and assume that c ^ A e . Write osc (/, c) — e - 6 where 6 > 0. Select r 
so that osc (/, c, r) < e — 6/2, and choose n with |c„ — c| < r/2. Then 
osc(/, Cn, r/2) < e which implies osc (/, c n ) < e, a contradiction. 

We are now ready to prove the first part of the theorem. Suppose 
that the set V of discontinuities of / has measure 0, and let e > 0. 
Since A e C T>, we can cover A e by a finite number of open intervals, 
say Ji, . . . , In, whose total length is < e. The complement of this union 
I of intervals is compact, and around each point z in this complement we 
can find an interval F z with sup^. ye p z \ f{x) — f(y)\ < e, simply because 
z fi A e . We may now choose a finite subcovering of U ze jd z , which we 
denote by ijv+i, • • • > In' ■ Now, taking all the end points of the intervals 
h,h, ■ ■ ■ ,In' we obtain a partition P of [a, 6] with 

N 

U[P, f) - C(P, f) < 2 M £ \I :j \ + e(b - a) < Ce. 

3 = 1 

Hence / is integrable on [a, b\, as was to be shown. 

Conversely, suppose that / is integrable on [a,b\, and let V be its 
set of discontinuities. Since V equals Ai/ n , it suffices to prove 
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that each A 1 / n has measure 0. Let e > 0 and choose a partition P — 
{xq,xi, . . . ,xn} so that U{P , /) — C(P, f) < e/n. Then, if Aj/ n inter- 
sects Ij = (xj-i,Xj) we must have swp x€l . f(x) — inf^^ f(x) > 1/n, and 
this shows that 

l - Y, 1^1 <U(PJ)-C(P,f)<e/n. 

71 {r-ij^A i/„#0} 

So by taking intervals intersecting A 1 i n and making them slightly larger, 
we can cover Ai/ n with open intervals of total length < 2e. Therefore 
Aj / n has measure 0, and we are done. 

Note that incidentally, this gives another proof that fg is integrable 
whenever / and g are. 

2 Multiple integrals 

We assume that the reader is familiar with the standard theory of multi- 
ple integrals of functions defined on bounded sets. Here, we give a quick 
review of the main definitions and results of this theory. Then, we de- 
scribe the notion of “improper” multiple integration where the range of 
integration is extended to all of M. d . This is relevant to our study of the 
Fourier transform. In the spirit of Chapters 5 and 6, we shall define the 
integral of functions that are continuous and satisfy an adequate decay 
condition at infinity. 

Recall that the vector space consists of all d-tuples of real numbers 
x = (xi,X 2 , ■ ■ • , Xd) with Xj £ M, where addition and multiplication by 
scalars are defined componentwise. 

2.1 The Riemann integral in 
Definitions 

The notion of Riemann integration on a rectangle R C K. d is an imme- 
diate generalization of the notion of Riemann integration on an interval 
[a, b\ C M. We restrict our attention to continuous functions; these are 
always integrable. 

By a closed rectangle in M d , we mean a set of the form 
R — { a.j < Xj < bj : 1 < j < d} 

where aj,bj £ M for 1 < j < n. In other words, R is the product of the 
one-dimensional intervals [aj, bj]\ 

R= [ai,6i] x x [a d ,b d \. 
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If Pj is a partition of the closed interval [a :n bj], then we call 
P = (P\, . . . , Pd) a partition of R; and if Sj is a subinterval of the 
partition Pj , then S = Si x ■ • • x Sd is a subrectangle of the partition 
P. The volume IS 1 ) of a subrectangle S is naturally given by the product 
of the length of its sides \S\ = Si x • • ■ x \Sd |, where Sj | denotes the 
length of the interval Sj . 

We are now ready to define the notion of integral over R. Given 
a bounded real-valued function / defined on R and a partition P, we 
define the upper and lower sums of / with respect to P by 

u{PJ) = ^[ su p/(t)] \ S \ and £( p J) = '52[ in {f( x )]\ s l 

xes x ^ s 

where the sums are taken over all subrectangles of the partition P. These 
definitions are direct generalizations of the analogous notions in one di- 
mension. 

A partition P' — (P[, . . . , Pj) is a refinement of P = (Pi, . . . , P d ) if 
each Pj is a refinement of Pj. Arguing with these refinements as we did 
in the one-dimensional case, we see that if we define 

U = inf U(P, /) and L = sup C(P, /), 

p P 

then both U and L exist, are finite, and U > L. We say that / is Rie- 
mann integrable on R if for every e > 0 there exists a partition P so 
that 


U(P, f) - C(P, f) < e. 


This implies that U = L, and this common value, which we shall denote 
by either 



x d ) dx i ■ ■ ■ dx d , 


ir 


f(x) dx, 


or 


ir 


f, 


is by definition the integral of / over R. If / is complex- valued, say 
f(x) = u(x) + iv(x ), where u and v are real- valued, we naturally define 


/ f(x) dx= u(x) dx + i v(x) dx. 
Ir Jr Jr 


In the results that follow, we are primarily interested in continuous 
functions. Clearly, if / is continuous on a closed rectangle R then / is 
integrable since it is uniformly continuous on R. Also, we note that if 
/ is continuous on, say, a closed ball B , then we may define its integral 


Ibookroot October 20, 2007 


2. Multiple integrals 291 

over B in the following way: if g is the extension of / defined by g(x) — 0 
if x (fi B, then g is integrable on any rectangle R that contains B , and 
we may set 


IB 


f{x) dx 



2.2 Repeated integrals 

The fundamental theorem of calculus allows us to compute many one 
dimensional integrals, since it is possible in many instances to find an 
antiderivative for the integrand. In R d , this permits the calculation of 
multiple integrals, since a d-dimensional integral actually reduces to d 
one-dimensional integrals. A precise statement describing this fact is 
given by the following. 


Theorem 2.1 Let f be a continuous function defined on a closed rect- 
angle R C M. d . Suppose R= R\X R 2 where R\ C R dl and R 2 C IR d2 
with d — d\ + d 2 . If we write x = ( xi,x 2 ) with Xi £ then F(x i) = 
f R f(x i,x 2 ) dx 2 is continuous on R\, and we have 


IR 


f(x) dx 




dx i- 


Proof. The continuity of F follows from the uniform continuity of / 
on R and the fact that 


\F(xi) - F(x[)\ < I lf(xi,x 2 ) - f(x[,x 2 )\ dx 2 . 
Jr 2 


To prove the identity, let Pi and P 2 be partitions of R\ and R 2 , respec- 
tively. If S and T are subrectangles in Pi and P 2l respectively, then the 
key observation is that 


sup f(x i,x 2 ) > sup 
SxT X! es 


sup f(x i,x 2 ) 
\X 2 er 


inf fix i,x 2 ) < inf 
SxT K J - Xi € S 


inf f(xi,x 2 ) 

X2 GT 


and 


Ibookroot October 20, 2007 


292 


Appendix: INTEGRATION 


Then, 

U(P,f) = Et su P /( x i’ 3 ^)] \ s x T \ 

frf, SxT 

>EE sup [sup f(x I,x 2 )j \T\ x \S\ 

<J rp xi gS X2 ST 

> E sup ( f f{x 1 ,x 2 )dx^\ |5| 

Cj X\(zS i?2 / 

>U (Pi, J f{xi,x 2 )dx2^. 

Arguing similarly for the lower sums, we find that 


C(P,f)<C(P u [ f(x l ,x 2 )dx 2 )<U{P 1 , 

JRi 


f(xi,x 2 )dx 2 ) < U(P,f), 


and the theorem follows from these inequalities. 


Repeating this argument, we find as a corollary that if f is continuous 
on the rectangle R cR d given by R = [«i, bi] x ■ ■ ■ [ad, bd\, then 





f(xi,...,x d )dx d ...dx 2 


dx i, 


where the right-hand side denotes d- iterates of one-dimensional integrals. 
It is also clear from the theorem that we can interchange the order of 
integration in the repeated integral as desired. 


2.3 The change of variables formula 


A diffeomorphism of class C 1 * , g : A —>■ B, is a mapping that is contin- 
uously differentiable, invertible, and whose inverse g -1 : B — > A is also 
continuously differentiable. We denote by Dg the Jacobian or derivative 
of g. Then, the change of variables formula says the following. 


Theorem 2.2 Suppose A and B are compact subsets of and 
g : A — ► B is a diffeomorphism of class C 1 . If f is continuous on B, 
then 



I f(9(y))\det(Dg)(y)\dy. 
J A 


The proof of this theorem consists first of an analysis of the special 
situation when g is a linear transformation L. In this case, if R is a 
rectangle, then 


\d(R)\ — I det(L)| |i?J 
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which explains the term | det(-Dg)). Indeed, this term corresponds to the 
new infinitesimal element of volume after the change of variables. 


2.4 Spherical coordinates 

An important application of the change of variables formula is to the case 
of polar coordinates in R 2 , spherical coordinates in R 3 , and their general- 
ization in R . These are particularly important when the function, or set 
we are integrating over, exhibit some rotational (or spherical) symme- 
tries. The cases d = 2 and d = 3 were given in Chapter 6 . More generally, 
the spherical coordinates system in M d is given by x — g(r, 9\, , O^-i) 
where 


Xl 

= r sin 9\ sin 9 2 ■ 

■ ■ sin 9d - 2 cos 9d~ 

X 2 

= r sin 9\ sin 9 2 ■ 

■ - sin 9 d - 2 sin 9 d - 

Xd - 1 

= r sin 9i sin 9 2 , 


Xd 

= r cos 8\ , 



with 0 < 9i < tt ior 1 < i < d — 2 and 0 < Od-i < 27r. The determinant 
of the Jacobian of this transformation is given by 

r d ~ l sin d-2 9\ sin d ~ 3 9 2 ■ ■ ■ sin 9d- 2 - 


Any point in x € R 0 ' — {0} can be written uniquely as ry with 7 G S d 1 
the unit sphere in M d . If we define 

/ /( 7 ) da^) = 

Js d - 1 


(-271- 


f{g{r,6)) sin d 2 6 >i sin d 3 6 2 ■ ■ ■ sin 9 d -2 dO d -i ■ ■ ■ d9 ± , 


d - 3 


10 JO 


then we see that if 5(0, N) denotes the ball of radius N centered at the 
origin, then 


( 3 ) 




dr da(j). 


In fact, we define the area of the unit sphere S d 1 C M d as 


Ud — 



An important application of spherical coordinates is to the calculation 
of the integral f A r Rl Rl \ \x\ x dx, where A(Ri,R 2 ) denotes the annulus 
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A(R\, R 2 ) 
we find 


{Ri < |*| < R 2 } and AeR. Applying polar coordinates, 


\x\ x dx = 


/A(Ri,R 2 ) 




r A+d_1 drdaij). 


Therefore 


Ia(r 1 ,r 2 ) 



J±L_[R$+ d - R$+ d ] if A jL-d, 
w d [log(i? 2 ) -log(iZi)] if A = —d. 


3 Improper integrals. Integration over 

Most of the theorems we just discussed extend to functions integrated 
over all of once we impose some decay at infinity on the functions we 
integrate. 


3.1 Integration of functions of moderate decrease 

For each fixed N > 0 consider the closed cube in centered at the origin 
with sides parallel to the axis, and of side length N: Qn = {\xj\ < N/2 : 
1 < j < d}. Let / be a continuous function on If the limit 


lim 

N— >00 



exists, we denote it by 

[ f(x)dx. 

JM. d 


We deal with a special class of functions whose integrals over exist. 
A continuous function f on M. d is said to be of moderate decrease if 
there exists A > 0 such that 


1 / 0*01 < 


A 

1 + |*|‘ i+1 ’ 


Note that if d — 1 we recover the definition given in Chapter 5. An 
important example of a function of moderate decrease in 1R is the Poisson 
kernel given by V y (x) = 

We claim that if / is of moderate decrease, then the above limit exists. 
Let In = Jq n f(x)dx. Each In exists because / is continuous hence 
integrable. For M > N, we have 


\Im — In\ < 


' Qm — Qn 


1/0*01 dx. 
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Now observe that the set Qm ~ Qn is contained in the annulus 
A(aN,bM) — {aN < \x\ < bM}, where a and b are constants that de- 
pend only on the dimension d. This is because the cube Qn is contained 
in the annulus N/2 < |x| < Ny/d/2, so that we can take a — 1/2 and 
b = \fd/ 2. Therefore, using the fact that / is of moderate decrease yields 

| Im ~ In\ < A [ |x| _d_1 dx. 

JaN<\x\<bM 

Now putting A = — d — 1 in the calculation of the integral of the previous 
section, we find that 

Vm “ ,n] - 0 (sv “ m ) ■ 

So if / is of moderate decrease, we conclude that {I n }n=i is a Cauchy 
sequence, and therefore f Rd f(x) dx exists. 

Instead of the rectangles Qn, we could have chosen the balls B N cen- 
tered at the origin and of radius N. Then, if / is of moderate decrease, 
the reader should have no difficulties proving that limjv->oo / B f(x) dx 
exists, and that this limit equals limjv-»oo Jq v f( x ) dx. 

Some elementary properties of the integrals of functions of moderate 
decrease are summarized in Chapter 6. 

3.2 Repeated integrals 

In Chapters 5 and 6 we claimed that the multiplication formula held for 
functions of moderate decrease. This required an appropriate interchange 
of integration. Similarly for operators defined in terms of convolutions 
(with the Poisson kernel for example). 

We now justify the necessary formula for iterated integrals. We only 
consider the case d = 2, although the reader will have no difficulty ex- 
tending this result to arbitrary dimensions. 

Theorem 3.1 Suppose f is continuous on R 2 and of moderate decrease. 
Then 

F(xi) = / f(x 1 ,x 2 )dx 2 
Jr 


is of moderate decrease on R, and 
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Proof. To see why F is of moderate decrease, note first that 
r Adx 2 r '' 




< 


1 + (xf + X2) 3 / 2 */|aS2|<|a;i| J |x 2 [>|xi| 

In the first integral, we observe that the integrand is<A/(l + |xi| 3 ),so 
f A dx 9 A f A! 


< 


•/|x 2 |<|aii| 1 + (xl + xj ) 3 / 2 1 + l^il 3 J|x 2 |<|a :i | 

For the second integral, we have 

f A dx-2 ... f dx 2 


dx 2 < 


1 + |xi| 2 ' 


< A 


< 


A " 


^|x 2 |>ki| 1 + l^l 3 |^i | 2 ’ 


J\x a \>\xi\ 1 + (x 2 +X 2 ) 3 / 2 

thus F is of moderate decrease. In fact, this argument together with 
Theorem 2.1 shows that F is the uniform limit of continuous functions, 
thus is also continuous. 

To establish the identity we simply use an approximation and Theo- 
rem 2.1 over finite rectangles. Write S c to denote the complement of a 
set S. Given e > 0 choose N so large that 


/ f(xi,x2)dx\dx2— / f(x\,x2)dx\dx 

J R 2 j I J\T X I M 


< e, 


where In — [—77, N], Now we know that 


/ f(xi,x 2 )dx 1 dx 2 = (/ f(x 1 ,X2)dx 2 )dx 1 . 

J I n x I n ^ In \^In / 


'In x In J In \J In 

But this last iterated integral can be written as 


/ / f(x 1 ,X2)dx 2 ) dx 1- / / f(xi,X2)dx 2 ) dx 1 

JM \J K / Jlfj \J R / 

- / f(xi,x 2 ) dx 2 dx 1. 

/ 


We can now estimate 


(/, f(xi,x 2 )dx^J dx 1 < O ^-^2 


C 


dx-2 


l<|xi|<A W|x 2 |>IV ( I + |yi |) 3 


dxi 


SOI iV 


Ibookroot October 20, 2007 


3. Improper integrals. Integration over M. d 


297 


A similar argument shows that 

/ (/ fi x i’ x z) dx 2J dxi 


“ N 


Therefore, we can find N so large that 

/ f(x 1 ,x 2 )dx 1 dx 2 - / ( / f(xi,x 2 )dx 2 \ dx x 

J InxIn •'R \«/R / 


< e, 


and we are done. 


3.3 Spherical coordinates 

In R d , spherical coordinates are given by x = ry, where r > 0 and 7 
belongs to the unit sphere If / is of moderate decrease, then 

for each fixed 7 € S d_1 , the function of / given by is also of 

moderate decrease on M. Indeed, we have 


fin) 


d - 1 


„d - 1 


< A 


< 


B 


1 + | r '7| rf + 1 1 + 


As a result, by letting R — > 00 in (3) we obtain the formula 




dr da ( 7 ). 


As a consequence, if we combine the fact that 


f f(R(x)) dx = 



/(x) dx, 


whenever I? is a rotation, with the identity (3), then we obtain that 
(4) f f(R(l)) dai'y) = [ /(y) dcr( 7 ) . 

./ S '*- 1 
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Seeley [29] gives an elegant and brief introduction to Fourier series and the 
Fourier transform. The authoritative text on Fourier series is Zygmund [36]. 
For further applications of Fourier analysis to a variety of other topics, see Dym 
and McKean [8] and Korner [21]. The reader should also consult the book by 
Kahane and Lemarie-Rieusset [20], which contains many historical facts and 
other results related to Fourier series. 

Chapter 1 

The citation is taken from a letter of Fourier to an unknown correspondent 
(probably Lagrange), see Herivel [15]. 

More facts about the early history of Fourier series can be found in Sections 
I-III of Rienrann’s memoir [27]. 

Chapter 2 

The quote is a translation of an excerpt in Riemann’s paper [27]. 

For a proof of Littlewood’s theorem (Problem 3), as well as other related 
“Tauberian theorems,” see Chapter 7 in Titchmarsh [32]. 

Chapter 3 

The citation is a translation of a passage in Dirichlet’s memoir [6]. 

Chapter 4 

The quote is translated from Hurwitz [17]. 

The problem of a ray of light reflecting inside a square is discussed in Chap- 
ter 23 of Hardy and Wright [13]. 

The relationship between the diameter of a curve and Fourier coefficients 
(Problem 1) is explored in Pfluger [26]. 

Many topics concerning equidistribution of sequences, including the results in 
Problems 2 and 3, are taken up in Kuipers and Niederreiter [22]. 

Chapter 5 

The citation is a free translation of a passage in Schwartz [28]. 

For topics in finance, see Duffie [7] , and in particular Chapter 5 for the Black- 
Scholes theory (Problems 1 and 2). 

The results in Problems 4, 5, and 6 are worked out in John [19] and Wid- 
der [34]. 

For Problem 7, see Chapter 2 in Wiener [35]. 

The original proof of the nowhere differentiability of fi (Problem 8) is in 
Hardy [12]. 

Chapter 6 

The quote is an excerpt from Cornrack’s Nobel Prize lecture [5] . 

More about the wave equation, as well as the results in Problems 3, 4, and 5 
can be found in Chapter 5 of Folland [9] . 
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A discussion of the relationship between rotational symmetry, the Fourier 
transform, and Bessel functions is in Chapter 4 of Stein and Weiss [31]. 

For more on the Radon transform, see Chapter 1 in John [18], Helgason [14], 
and Ludwig [25]. 

Chapter 7 

The citation is taken from Bingham and Tukey [2]. 

Proofs of the structure theorem for finite abelian groups (Problem 2) can be 
found in Chapter 2 of Herstein [16], Chapter 2 in Lang [23], or Chapter 104 in 
Korner [21]. 

For Problem 4, see Andrews [1], which contains a short proof. 

Chapter 8 

The citation is from Bochner [3]. 

For more on the divisor function, see Chapter 18 in Hardy and Wright [13]. 
Another “elementary” proof that L( l,x) ^ 0 can be found in Chapter 3 of 
Gelfond and Linnik [11]. 

An alternate proof that L( l,y) ^ 0 based on algebraic number theory is in 
Weyl [33]. Also, two other analytic variants of the proof that L(l,x) 0 can be 
found in Chapter 109 in Korner [21] and Chapter 6 in Serre [30]. See also the 
latter reference for Problems 3 and 4. 

Appendix 

Further details about the results on integration reviewed in the appendix can 
be found in Folland [10] (Chapter 4), Buck [4] (Chapter 4), or Lang [24] (Chap- 
ter 20). 
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Symbol Glossary 


The page numbers on the right indicate the first time the symbol or 
notation is defined or used. As usual, Z, Q, M and C denote the integers, 
the rationals, the reals, and the complex numbers respectively. 


A 

Laplacian 


20, 

185 

\z , z 

Absolute value and complex 
conjugate 



22 

e z 

Complex exponential 



24 

sinhx, coshx 

Hyperbolic sine and hyperbolic 



28 


cosine 




fin), a n 

Fourier coefficient 



34 

f{9)~Y,a n e ine 

Fourier series 



34 

SnU) 

Partial sum of a Fourier series 



35 

Dn, Vr, d Ni d* n 

Dirichlet kernel, conjugate, and 
modified 

37 

, 95, 

165 

Pr,Vy, V { y d) 

Poisson kernels 

37, 

149, 

210 

0, 0 

Big 0 and little o notation 


42 

, 62 

c k 

Functions that are k times dif- 
ferentiable 



44 

f * 9 

Convolution 

44, 139, 

184, 

239 

<Jn, n N (f) 

Cesaro mean 


52 

, 53 

Fn, Fr 

Fejer kernel 


53, 

163 

A{r), A r {f) 

Abel mean 


54 

, 55 

X[a,b\ 

Characteristic function 



61 

f(9 + ), f(9~) 

One-sided limits at jump dis- 
continuities 



63 

R d , C d 

Euclidean spaces 



71 

X 1 Y 

Orthogonal vectors 



72 

e 2 (z) 

Square summable sequences 



73 

77 

Riemann integrable functions 



75 

CM 

Zeta function 



98 

[*], <*> 

Integer and fractional parts 



106 

Ajv, ctn,k, 

Delayed means 

114, 

127, 

174 

TT r U 'lj(d) 

■tit, tit, tl t 

Heat kernels 

120, 

146, 

209 

M(R) 

Space of functions of moderate 
decrease on M 



131 
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m 

Fourier transform 

134, 

181 

S,S(R),S(R d ) 

Schwartz space 

134, 

180 

r 2 ., r 2 !' 

Upper half-plane and its 
closure 


149 

i?(s), 0 {z\t) 

Theta functions 

155, 

156 

ns) 

Gamma function 


165 

INI, |z|; (x,y), x ■ y 

Norm and inner product in 
R d 

71, 

176 

l“l. (icT 

Monomial, its order, and 
differential operator 


176 

s\ s 2 , S d - 1 

Unit circle in R 2 , and unit 
spheres in R 3 , R d 

179, 

180 

M t , M t 

Spherical mean 

194, 

216 

Jn 

Bessel function 

197, 

213 

~P, Vtfl 

Plane 


202 

77, 77* 

Radon and dual Radon 
transforms 

203, 

205 

A d , V d 

Area and volume of the 
unit sphere in R d 


208 

Z (N) 

Group of 7V th roots of 
unity 


219 

Z/7VZ 

Group of integers modulo 
N 


221 

G, \G\ 

Abelian group and its or- 
der 

226, 

228 

G^H 

Isomorphic groups 


227 

G\ x G 2 

Direct product of groups 


228 

Z*(q) 

Group of units modulo q 

227, 229, 

244 

G 

Dual group of G 


231 

a\b 

a divides b 


242 

gcd (a, 6) 

Greatest common divisor 
of a and b 


242 

¥>(?) 

Number of integers rela- 
tively prime to q 


254 

X. Xo 

Dirichlet character, and 
trivial Dirichlet character 


254 

L(s,x) 

Dirichlet T-function 


256 

lo gi (rb)> lo g2^(s 5 x) 

Logarithms 

258, 264 

d(k) 

Number of positive divi- 
sors of k 


269 
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Relevant items that also arise in Book I are listed in this index, preceeded 
by the numeral I. 


Abel 

means, 54 
summable, 54 
abelian group, 226 
absolute value, 23 
absorption coefficient, 199 
amplitude, 3 

annihilation operator, 169 
approximation to the identity, 
49 

attenuation coefficient, 199 

Bernoulli 

numbers, 97, 167 
polynomials, 98 
Bernstein’s theorem, 93 
Bessel function, 197 
Bessel’s inequality, 80 
best approximation lemma, 78 
Black-Scholes equation, 170 
bump functions, 162 

Cauchy problem (wave equa- 
tion), 185 

Cauchy sequence, 24 
Cauchy-Schwarz inequality, 72 
Cesaro 

means, 52 
sum, 52 
summable, 52 
character, 230 

trivial (unit), 230 
class C k , 44 


closed rectangle, 289 
complete vector space, 74 
complex 

conjugate, 23 
exponential, 24 
congruent integers, 220 
conjugate Dirichlet kernel, 95 
convolution, 44, 139, 239 
coordinates 

spherical in U. d , 293 
creation operator, 169 
curve, 102 

area enclosed, 103 
closed, 102 
diameter, 125 
length, 102 
simple, 102 

d’Alembert’s formula, 11 
delayed means, 114 
generalized, 127 
descent (method of), 194 
dilations, 133, 177 
direct product of groups, 228 
Dirichlet characters, 254 
complex, 265 
real, 265 
Dirichlet kernel 

conjugate (on the circle), 95 
modified (on the real line), 
165 

on the circle, 37 
Dirichlet problem 
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annulus, 64 
rectangle, 28 
strip, 170 
unit disc, 20 

Dirichlet product formula, 256 
Dirichlet’s test, 60 
Dirichlet ’s theorem, 128 
discontinuity 
jump, 63 

of a Riemann integrable 
function, 286 

divisibility of integers, 242 
divisor, 242 

greatest common, 242 
divisor function, 269, 280 
dual 

X-ray transform, 212 
group, 231 

Radon transform, 205 

eigenvalues and eigenvectors, 
233 

energy, 148, 187 
of a string, 90 

equidistributed sequence, 107 
ergodicity, 111 
Euclid’s algorithm, 241 
Euler 

constant 7, 268 
identities, 25 
phi-function, 254, 276 
product formula, 249 
even function, 10 
expectation, 160 
exponential function, 24 
exponential sum, 112 

fast Fourier transform, 224 
Fejer kernel 

on the circle, 53 
on the real line, 163 
Fibonacci numbers, 122 


Fourier 

coefficient (discrete), 236 
coefficient, 16, 34 
on Z{N), 223 
on a finite abelian group, 
235 

series, 34 

sine coefficient, 15 
Fourier inversion 

finite abelian group, 235 
on Z(N), 223 
on M, 141 
on R d , 182 

Fourier series, 34, 235 
Abel means, 55 
Cesaro mean, 53 
delayed means, 114 
generalized delayed means, 
127 

lacunary, 114 
partial sum, 35 
uniqueness, 39 
Fourier series convergence 
mean square, 70 
pointwise, 81, 128 
Fourier series divergence, 83 
Fourier transform, 134, 136, 181 
fractional part, 106 
function 

Bessel, 197 
exponential, 24 
gamma, 165 

moderate decrease, 131, 
179, 294 
radial, 182 

rapidly decreasing, 135, 178 
sawtooth, 60, 83 
Schwartz, 135, 180 
theta, 155 
zeta, 98 

gamma function, 165 
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Gaussian, 135, 181 
Gibbs’s phenomenon, 94 
good kernels, 48 
greatest common divisor, 242 
group (abelian), 226 
cyclic, 238 
dual, 231 

homomorphism, 227 
isomorphic, 227 
of integers modulo TV, 221 
of units, 227, 229, 244 
order, 228 

Holder condition, 43 
harmonic function, 20 

mean value property, 152 
harmonics, 13 
heat equation, 20 

d-dimensions, 209 
on the real line, 146 
steady-state, 20 
time-dependent, 20 
heat kernel 

d-dimensions, 209 
of the real line, 146, 156 
of the circle, 120, 124, 156 
Heisenberg uncertainty princi- 
ple, 158, 168, 209 

Hermite 

functions, 173 
operator, 168 

Hermitian inner product, 72 
Hilbert space, 75 
homomorphism, 227 
Hooke’s law, 2 
Huygens’ principle, 193 
hyperbolic cosine and sine func- 
tions, 28 

hyperbolic sums, 269 

inner product, 71 
Hermitian, 72 
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strictly positive definite, 71 
integer part, 106 
integrable function (Riemann), 
31, 281 

inverse of a linear operator, 177 
isoperimetric inequality, 103, 
122 

jump discontinuity, 63 

Kirchhoff’s formula, 211 

L-function, 256 
Landau kernels, 164 
Laplace operator, 20 
Laplacian, 20, 149, 185 
polar coordinates, 27 
Legendre 

expansion, 96 
polynomial, 95 
light cone 

backward, 193, 213 
forward, 193 
linearity, 6, 22 
Lipschitz condition, 82 
logarithm 

logi, 258 
log 2 , 264 

mean value propery, 152 
measure zero, 287 
moderate decrease (function) , 
131 

modulus, 23 
monomial, 176 
multi- index, 176 
multiplication formula, 140, 183 

natural frequency, 3 
Newton’s law, 3 
of cooling, 19 

nowhere differentiable function, 
113, 126 
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odd function, 10 
order of a group, 228 
orthogonal elements, 72 
orthogonality relations, 232 
orthonormal family, 77 
oscillation (of a function), 288 
overtones, 6, 13 

par ametriz ation 
arc-length, 103 
reverse, 121 
Parseval’s identity, 79 

finite abelian group, 236 

part 

fractional, 106 
integer, 106 
partition 

of a rectangle, 290 
of an interval, 281 
period, 3 

periodic function, 10 
periodization, 153 
phase, 3 

Plancherel formula 
on Z(N), 223 
on R, 143 
on R d , 182 

Planck’s constant, 161 
plucked string, 17 
Poincare’s inequality, 90 
Poisson integral formula, 57 
Poisson kernel 

d-dimensions, 210 
on the unit disc, 37, 55 
on the upper half-plane, 149 
Poisson kernels, comparison, 157 
Poisson summation formula, 
154-156, 165, 174 
polar coordinates, 179 
polynomials 

Bernoulli, 98 
Legendre, 95 


INDEX 

pre-Hilbert space, 75 
prime number, 242 
primes in arithmetic progres- 
sion, 245, 252, 275 
probability density, 160 
profile, 5 
pure tones, 6 
Pythagorean theorem, 72 

Radon transform, 200, 203 
rapid decrease, 135 
refinement (partition), 281, 290 
relatively prime integers, 242 
repeated integrals, 295 
Reuleaux triangle, 125 
Riemann integrable function, 31, 
281, 290 

Riemann localization principle, 
82 

Riemann-Lebesgue lemma, 80 
root of unity, 219 
rotation, 177 

improper, 177 
proper, 177 

sawtooth function, 60, 83, 84, 
94, 99, 278 
scaling, 8 

Schwartz space, 134, 180 
separation of variables, 4, 11 
simple harmonic motion, 2 
space variables, 185 
spectral theorem, 233 

commuting family, 233 
speed of propagation (finite), 
194 

spherical 

coordinates in R d , 293 
mean, 189 
wave, 210 
spring constant, 3 
standing wave, 4 
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subordination principle, 210 
subrectangle, 290 
summable 
Abel, 54 
Cesaro, 52 

summation by parts, 60 
superposition, 6, 14 
symmetry-breaking, 83 

theta function, 155 

functional equation, 155 

tone 

fundamental, 13 
pure, 11 

translations, 133, 177 
transpose of a linear operator, 
177 

traveling wave, 4 
trigonometric 

polynomial, 35 
degree, 35 
series, 35 

Tychonoff’s uniqueness theo- 
rem, 172 

uncertainty, 160 
unit, 229 

unitary transformation, 143, 233 
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variance, 160 
vector space, 70 
velocity of a wave, 5 
velocity of the motion, 7 
vibrating string, 90 

wave 

standing, 4, 13 
traveling, 4 
velocity, 5 
wave equation, 184 
d-dimensional, 185 
d’Alembert’s formula, 11 
is linear, 9 
one-dimensional, 7 
time reversal, 11 
Weierstrass approximation theo- 
rem, 54, 63, 144, 163 

Weyl 

criterion, 112, 123 
estimate, 125 
theorem, 107 

Wirtinger’s inequality, 90, 122 

X-ray transform, 200 

zeta function, 98, 155, 166, 248 


